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SPLEEN TYROSINE KINASE CATALYTIC DOMAIN: 
CRYSTAL STRUCTURE AND BINDING POCKETS THEREOF 



[0001] This application claims benefit of United 
States Provisional Application No. 60/419,382, filed 
October 16, 2002, the disclosure of which is incorporated 
herein by reference. 



TECHNICAL FIELD OF THE INVENTION 
[0002] The present invention relates to molecules or 
molecular complexes which comprise binding pockets of the 
catalytic domain of Spleen Tyrosine Kinase protein 

10 (Sykcat) and its homologues . The present invention 

provides a computer comprising a data storage medium 
encoded with the structure coordinates of such binding 
pockets. This invention also relates to methods of using 
the structure coordinates to solve the structure of 

15 homologous proteins or protein complexes. In addition, 
this invention relates to methods of using the structure 
coordinates to screen for and design compounds, including 
inhibitory compounds, that bind to Sykcat or homologues 
thereof. The invention also relates to crystallizable 
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compositions and crystals conqprising Sykcat protein or 
Sykcat protein complexes . 



Background of the Invention 
5 [0003] Syk (Spleen tyrosine kinase) is a cytoplasmic 
72 kDa (-630 amino acids) protein- tyrosine kinase (PTK) 
that was originally foiind to be expressed in the spleen 
and thymus (Zioncheck et al-, J". Appl. Cryst. 263, pp. 
19195-19202 (1988))- It is widely expressed in a variety 

10 of hematopoietic cells, including B- and T-cells at 

various stages of development. Syk belongs to a family 
of non-receptor PTKs which also includes Zap70 protein- 
tyrosine kinase (ZAP-70) , a PTK implicated in T-cell 
receptor signaling (Chan et al . , J*. Immunology, 152, pp. 

15 4758-4766 (1994)). Human Syk has 93% amino acid homology 
to porcine Syk, greater than 90% amino acid sequence 
identity with murine Syk, and 73% identity to h\iman ZAP- 
70 (Law et al., J. Biol. Chem. , 269, pp. 12310-12319 
(1994); Furlong et al.^ Biochlm. Biophys. Acta, 1355, pp. 

20 177-190 (1997)). Syk is more abundantly expressed than 
ZAP-70, whose expression is restricted to T- and natural 
killer (NK-) cells (Chan et al., i7. Immunology, 152, pp. 
4758-4766 (1994)). 

[0004] Immune receptors, including the T- and B-cell 
25 receptors (TCR and BCR, respectively) , control lyrtqphocyte 
development through the combination of the responses of 
native and acquired immunity (Alberola-Ila et al., Annu. 
Rev. Immunol., 15, pp. 317-404 (1997); Reth & Wienands 
Annu. -Rev. Jmmunol . , 15, pp. 453-479 (1997); Turner et 
30 al.. Immunology Today, 3, pp. 148-154 (2000)). The 
activation of various PTKs is an early and essential 
event in the transduction of signals from immune 
receptors . 
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[0005] Gene- targeting studies have demonstrated an 
essential role for Syk in signal transduction via immune 
receptors (Turner et al., Immunologry Today, 3, pp. 148- 
154 (2 000) ) . Src family kinases phosphorylate 
5 Immunoreceptor tyrosine-based activation motifs (ITAMs) 
on the cytoplasmic tail of cell-surface receptors, 
creating docking sites for the SH2 domains of Syk. 
Binding of the tandem SH2 domains in Syk to the 
phosphorylated ITAMs leads to activation of Syk and 
10 thereby to signal transduction (Bolen & Brugge, Annu. 
J^ev. Jjnznunol., 15, pp: 371-404, (1997); Steele et al . , 
Gene, 239, pp. 91-97 (1999)). 

[0006] Directed disruption of the Syk gene in knockout 
mice leads to embryonic hemorrhage and death (Cheng et 

15 al., l\7;atujre, 378, pp. 303-306 (1995); Turner et al., 

Nature, 378, pp. 298-302 (1995)). In addition, Syk is 
commonly expressed in normal human breast tissue, benign 
breast lesions, and low-tumorigenic breast cancer cell 
lines (Coopman et al., iy^ature, 406, pp. 742-747 (2000)). 

20 In cancerous breast tissue or cell lines, Syk mRNA and 
protein levels are low or undetectable, suggesting that 
loss of Syk expression may be associated with the 
development of a malignant phenotype in breast cancer 
(Coopman et al . , 2\rature, 406, pp. 742-747 (2000)). 

25 Introduction of wild type Syk into a Syk-knockout breast 
cancer cell line potently inhibited tiimor growth and 
metastasis in athymic mice. Conversely, overexpression 
of a Syk protein that has been mutated to have no kinase 
activity in a Syk-positive breast cancer cell line 

30 markedly increased its tumor incidence and growth. Tumor 
incidence and growth was retarded by reintroduction of 
wildtype Syk. Thus, Syk appears to be an important 
feature of epithelial cell growth control and a potential 
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tumor suppressor in human breast cancers (Coopman et al . , 
Nature, 406, pp. 742-747 (2000)). 

[0007] Syk family tyrosine kinases contain tandem N- 
terminal SH2 domains and a C-terminal catalytic kinase 
5 domain. These domains are separated by a linker 

region", designated inter-domain B. As discussed above, 
the SH2 domains bind phospho tyrosines in ITAMs. The 
linker region contains multiple tyrosine residues that, 
upon phosphorylation, act as docking sites for other 
10 proteins such as phospholipase Cyl (PLCyD # VAV and CBL, 
all of which are possible Syk substrates (Sillman and 
Monroe, J. Biol. Chem. , 270, pp. 11806-11811 (1995); 
Furlong et al., Biochim, Biophys. Acta, 1355, pp. 177-190 

(1997) ; Law et al . , Mol. Cell. Biol., 16, pp. 1305-1315 
15 1996)) . Syk does not contain an SH3 domain or a 

membrane -spanning region. The kinase (catalytic) and SH2 
domains show 25-40% identity in sequence to PTKs in other 
families, but the intervening sequences, including linker 
regions, are unique. 
20 [0008] The crystal structure of the regulatory SH2 

domains of Syk bound to a phospho ry la ted ITAM peptide was 
solved by multiple isomorphous replacement at 3.0 A 
(Futterer et al., J". Mol. Biol., 281, pp. 523-537 

(1998) ) . The two SH2 domains and an intervening region 
25 which connects them together form a Y-shaped molecule. 

Both SH2 domains fold in a similar manner to other SH2 
domains, each of which contain a large P-sheet flanked by 
two a-helices (Kuriyan & Cowburn, Curr. Opin. Struct. 
Biol., 3, pp. 828-837 (1993)). 
30 [0009] Regulation of Syk protein expression has been 
implicated as a strategy for treatment of breast cancer 
(Stewart & Pietenpol, Breast Cancer Res., 3, pp. 5-7 
(2001)), leukemia (Goodman, et al . , Oncogene, 20, pp. 
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3969-78 (2001)), asthma (Yamada, et al., j. immumol., 
167, pp. 283-8 (2001)), Systemic Lupus Erythematosus 
(SLE) (Liossis, et al . , J". Investig. Med., 49, pp. 157-65 
(2001) ) , and other inflammatory diseases (Malaviya, et 
5 al.. Am. J. Ther. , 8, pp. 417-24 (2001)). 

[0010] As no structural information on the catalytic 

domain of Syk or ZAP-7 0 has been available, detailed 
information about the catalytic ATP-binding site and the 
substrate binding site has been absent. It would be 

10 desirable to have detailed structural models of the 
catalytic domain of Syk to screen for, design and 
optimize drugs to modulate Syk and treat diseases 
including cancer, asthma. Systemic Lupus Erythematosus 
(SLE) and other inflammatory diseases. Additional 

15 information about the kinase mechanism of Syk also would 
be revealed by a structure of the catalytic domains of 
the enzyme with a substrate or inhibitor. 

SUMMARY OF THE INVENTION 
20 [0011] The present invention provides, for the first 
time, the crystal structures of complexes of the 
catalytic domain of Syk (Sykcat) and methods of using 
these crystal structures for drug design and discovery. 
[0012] The present invention also provides crystalline 
25 molecules or molecular complexes comprising Sykcat binding 
pockets, or Sykcat-like binding pockets that have similar 
three-dimensional shapes. In one embodiment, the 
crystalline molecules or molecular complexes are Syk 
proteins, Sykcat proteins, Syk or Sykcat protein complexes 
30 or homologues thereof. The invention provides crystal 
compositions comprising Sykcat protein, Sykcat protein 
conplex, or homologues thereof in the presence or absence 
of a chemical entity. The invention also provides a 
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method of crystallizing Sykcat protein, Sykcat protein 
complex, or homologues thereof. 

[0013] The invention further provides a computer 
comprising a data storage medium that comprises the 
5 structure coordinates of molecules and molecular 

complexes comprising all or part of the Sykcat or Sykcat- 
like binding pocket. Such storage medi\im, when read and 
utilized by a computer programmed with appropriate 
software, displays on a computer screen or similar 
10 viewing device, a three-dimensional graphical 

representation of the molecule or molecular coinplex 
comprising such binding pockets. 

[0014] The invention provides methods for screening, 
designing, optimizing, evaluating and identifying 
15 compounds or chemical entities that bind to the molecules 
or molecular complexes or their binding pockets. The 
methods can be used to identify agonists and antagonists 
of Syk and its homologues. 

[0015] The invention also provides a method for 
20 determining at least a portion of the three-dimensional 
structure of molecules or molecular complexes which 
contain some structurally similar features to Syk, 
particularly Sykcat homologues. This is achieved by using 
at least some of the structure coordinates obtained from 
25 the Sykcat complexes. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0016] The following abbreviations are used in Figures 
1 and 2: 

30 ^'Atom type" refers to the element whose 

coordinates are measured. The first letter in the column 
defines the element. 
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"Resid" refers to the amino acid residue 

identity. 

'"X, define the atomic position of the 

element measured. 
5 '"B" is a thermal factor that measures movement 

of the atom around its atomic center. 

''Occ" is an occupancy factor that refers to the 
fraction of the molecules in which each atom occupies the 
position specified by the coordinates. A value of ^^l" 
10 indicates that each atom has the same conformation, i.e., 
the same position, in all molecules. 

''Mol" refers to the molecule in the asymmetric 

unit . 

[0017] Figure lA (lA-1 to lA-88) lists the atomic 

15 structure coordinates for the Sykcat (amino acid residues 
358-405 and 411-635 of full-length human Syk protein (SEQ 
ID NO: 1)) complexed with staurosporine at a resolution 
of 1.65 A as derived by X-ray diffraction from the 
crystal . The second line for each atom gives the values 

20 from anisotropic B factor refinement for that atom. The 
coordinates are listed in Protein Data Bank (PDB) format. 
Residues STU B, HOH W represent staurosporine and water, 
respectively. Amino acid residues identified as C 
residues are part of N- terminal end of the neighboring 

25 Sykcat molecule 

[0018] Figure 2A (2A-1 to 2A-39) lists the atomic 
structure coordinates for the Sykcat"PT426-adenylyl 
imidodiphosphate (AMP-PNP) complex at 2.4 ^A. as derived by 
X-ray diffraction from the crystal. The Sykcat model 

30 contains amino acid residues 364-380, 383-404, and 412- 
634 of full-length Syk protein (SEQ ID NO: 1) . The 
coordinates are listed in Protein Data Bank (PDB) format. 
Residues PTR, ANP B, HOH W, and MG M represent 
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phosphorylated tyrosine, adenylyl imidodiphosphate, 
water, and magnesium ion, respectively- Amino acid 
residues identified as "C" are from the peptide PT426 
(SEQ ID NO: 2) . 
5 [0019] Figure 3 depicts a ribbon diagram of the 
overall fold of the Sykcat-staurosporine complex. 
Staurosporine is shown in stick representation. The a- 
helices and (5-strands are labeled as aC-al and pi-pll, 
respectively . 

10 [0020] Figure 4 depicts a ribbon diagram of the 

overall fold of the Sykcat-PT426-AMP-PNP complex. PT426 
and AMP-PNP are shown in stick representation. The a- 
helices and p-strands are labeled as aC~aI and pl-pil, 
respectively . 

15 [0021] Figure 5 shows a detailed representation of 
pockets in the Sykcat-staurosporine complex. 
Staurosporine is shown in stick representation and Sykcat 
is shown as a ribbon. Contacts between Sykcat and 
staurosporine are represented by dashed lines. 

20 [0022] Ficrure 6 shows a detailed representation of the 
substrate binding pocket in the Sykcat-PT42 6 -AMP-PNP 
complex. PT426 is shown in stick representation and 
Sykcat is shown as a ribbon. Contacts between Sykcat and 
staurosporine are represented by dashed lines. 

25 [0023] Figure 7 shows a diagram of a system used to 

carry out the instructions encoded by the storage mediiim 
of Figures 8 and 9. 

[0024] Figure 8 shows a cross section of a magnetic 
storage medium. 

30 [0025] Figure 9 shows a cross section of an optically- 
readable data storage medixim. 
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DETAILED DESCRIPTION OF THE INVENTION 
[0026] In order that the invention described herein 
may be more fully understood, the following detailed 
description is set forth. 
5 [0027] Throughout the specification, the word 
^'comprise", or variations such as '^comprises" or 
**comprising" will be understood to imply the inclusion of 
a stated integer or groups of integers but not exclusion 
of any other integer or groups of integers . 
10 [0028] The following abbreviations are used throughout 
the application: 



A 




Ala 




Alanine 


T 




Thr 




Threonine 


V 




Val 




Valine 


C 




Cys 




Cysteine 


L 




Leu 




Leucine 


Y 




Tyr 




Tyrosine 


I 




He 




Isoleucine 


N 




Asn 




Asparagine 


P 




Pro 




Proline 


Q 




Gin 




Glutamine 


F 




Phe 




Phenylalanine 


D 




Asp 




Aspartic Acid 


W 




Trp 




Tryptophan 


E 




Glu 




Glutamic Acid 


M 




Met 




Methionine 


K 




Lys 




Lysine 


6 




Gly 




Glycine 


R 




Arg 




Arginine 


S 




Ser 




Serine 


H 




His 




Histidine 



[0029] As used herein, the following definitions shall 
15 apply unless otherwise indicated. 

[0030] The term '^abouf when used in the context of 
root mean square deviation or RMSD values takes into 
consideration the standard error of the RMSD value, which 
is ± 0.1 A. 

20 [0031] The term '^associating with" refers to a 

condition of proximity between a chemical entity or 
compound, or portions thereof, and a binding pocket or 
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binding site on a protein. The association may be non- 
covalent wherein the juxtaposition is energetically 
favored by hydrogen bonding, hydrophobic, van der Waals 
or electrostatic interactions or it may be covalent. 
5 [0032] The term ''ATP analogue" refers to a compound 
derived from adenosine-5 ' -triphosphate (ATP). The 
compound can be adenosine, AMP, ADP, or a non- 
hydrolyzable analogue, such as, but not limited to 
adenylyl imidodiphosphate (AMP-PNP) . The analogue may be 
10 in complex with magnesium or manganese ions. 

[0033] The term ''binding pocket" refers to a region of 
a molecule or molecular complex, that, as a result of its 
shape, favorably associates with another chemical entity 
or coinpound. The term "pocket" includes, but is not 
15 limited to, a cleft, channel or site or some combination 
thereof . Sykcat or Sykcat-like molecules may have binding 
pockets which include, but are not limited to, peptide or 
substrate binding sites, and ATP-binding sites. 
[0034] The term "catalytic active site" or "active 
20 site" refers to the portion of the protein kinase to 
which nucleotide substrates bind. For example, the 
catalytic active site of Sykcat is at the interface 
between the N- terminal, p-strand lobe or sub-domain and 
the C- terminal, a-helical lobe or sub-domain, 
25 [0035] The term "catalytic domain of Syk" or "Syk 
catalytic domain" refers to the kinase domain of the 
human Syk molecule. This domain is located at the C- 
terminal end of the Syk protein. (See, Latour et al., 
EMBO J., 17, pp. 2584-2595 (1998)). The domain includes, 
30 for example, the catalytic active site comprising the 
catalytic residues. The domain in the Syk protein 
coitqprises amino acid residues from about 343 to 639. 
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[0036] The term "chemical entity" refers to chemical 
compounds/ complexes of at least two chemical compounds, 
and fragments of such compounds or complexes. The 
chemical entity can be, for example, a ligand, substrate, 
5 nucleotide triphosphate, nucleotide diphosphate, 

phosphate, nucleotide, agonist, antagonist, inhibitor, 
antibody, peptide, protein or drug. In one embodiment, 
the chemical entity is an inhibitor or substrate for the 
active site. 

10 [0037] The term "complex" or "molecular complex" 

refers to a protein associated with a chemical entity. 
[0038] The term conservative substitutions" refers to 
residues that are physically or functionally similar to 
the corresponding reference residues. That is, a 

15 conservative substitution and its reference residue have 
similar size, shape, electric charge, and chemical 
properties, including the ability to form covalent or 
hydrogen bonds, or the like. Preferred conservative 
substitutions are those fulfilling the criteria defined 

20 for an accepted point mutation in Dayhoff et al.. Atlas 

of Protein Sequence and Structure, 5, pp. 345-3 52 (1978 & 
Supp.), which is incorporated herein by reference. 
Examples of conservative substitutions are substitutions 
including but not limited to the following groups: (a) 

25 valine, glycine; (b) glycine, alanine; (c) valine, 

isoleucine, leucine; (d) aspartic acid, glutamic acid; 
(e) asparagine, glutamine; (f) serine, threonine; (g) 
lysine, arginine, methionine; and (h) phenylalanine, 
tyrosine. 

30 [0039] The term **contact score" refers to a measure of 
shape complementarity between the chemical entity and 
binding pocket, which is correlated with an RMSD value 
obtained from a least square super iitqpos it ion between all 
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or part of the atoms of the chemical entity and all or 
part of the atoms of the ligand bound (for example, AMP- 
PNP, staurosporine, PT426) in the binding pocket 
according to Figure 1 or 2 . The docking process may be 
5 facilitated by the contact score or RMSD values. For 
example, if the chemical entity moves to an orientation 
with high RMSD, the system will resist the motion. A set 
of orientations of a chemical entity can be ranked by 
contact score. A lower PMSD value will give a higher 
10 contact score. See Meng et al. J. Comp. Chem. , 4, 505- 
524 (1992) . 

[0040] The term "correspond to" or "corresponding 
amino acids" when used in the context of amino acid 
residues that correspond to Syk amino acid residues 

15 refers to particular amino acid residues or analogues 

thereof in a Syk homologue that correspond to amino acid 
residues in the Syk protein. The corresponding amino 
acid may be an identical, mutated, chemically modified, 
conserved, conservatively substituted, functionally 

20 equivalent or homologous amino acid when compared to the 
Syk amino acid residue to which it corresponds . 
[0041] Methods for identifying a corresponding amino 
acid are known in the art and are based upon secpience 
alignment, structural alignment, similarities in 

25 biochemical or structural function, or a combination 
thereof as compared to the Syk protein. For example, 
corresponding amino acid residues may be identified by 
superimposing the backbone atoms of the amino acid 
residues in Syk and the protein using well known software 

30 applications, such as QUANTA (Molecular Simulations, 

Inc., San Diego, CA ©2000). The corresponding amino acid 
residues may also be identified using sequence alignment 
programs such as the "bestfit" program, available from 
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the Genetics Coinputer Group which uses the local homology 
algorithm described by Smith and Waterman in Advances in 
Applied Mathematics 2, 482 (1981), which is incorporated 
herein by reference, or CLUSTAL W Alignment Tool, supra. 
5 [0042] The term '"crystallization solution" refers to a 
solution that promotes crystallization comprising at 
least one agent, including a buffer, one or more salts, a 
precipitating agent, one or more detergents, sugars or 
organic compounds, lanthanide ions, a poly-ionic 
10 compound, and/or a stabilizer. 

[0043] The term "docking" refers to orienting, 
rotating, translating a chemical entity in the binding 
pocket, domain, molecule or molecular complex or portion 
thereof. Docking may be performed by distance geometry 
15 methods that find sets of atoms of a chemical entity that 
match sets of sphere centers of the binding pocket, 
domain, molecule or molecular complex or portion thereof. 
See Meng et al. J. Comp. Chem. , 4, 505-524 (1992). 
Sphere centers are generated by providing an extra radius 
20 of given length from the atoms (excluding hydrogen atoms) 
in the binding pocket, domain, molecule or molecular 
complex or portion thereof. Real-time interaction enercfy 
calculations, energy minimizations or rigid-body 
minimizations (Gschwend, et al., J. Mol . Recognition, 
25 9:175-186 (1996)) can be performed while orienting the. 
chemical entity to facilitate docking. For example, 
interactive docking experiments can be designed to follow 
the path of least resistance thereby simulating an 
interactive energy minimization. If the user in an 
30 interactive docking experiment makes a move to increase 
the energy, the system will resist that move. However, 
if that user makes a move to decrease energy, the system 
will favor that move by increased responsiveness. (Cohen, 
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et al., J. Med- Chem. 33:889-894 (1990)). Docking can 
also be performed by combining a Monte Carlo search 
technique with rapid energy evaluation using molecular 
affinity potentials. See Goodsell and Olson, Proteins: 
5 Structure, Function and Genetics 8:195-202 (1990). 
Software programs that carry out docking functions 
include but are not limited to MATCHMOL (Cory et al . , J. 
Mol. Graphics, 2, 39 (1984); MOLFIT (Redington, Cojnput- 
Chem., 16, 217 (1992)) and DOCK (Meng et al., supra). 

10 [0044] The term "'domain'' refers to a structural \init 

of the Syk protein or homologue. The domain can coinprise 
a binding pocket, a sequence or structural motif. 
[0045] The term full-length Syk" refers to the 
complete hximan Syk protein (amino acids residues 1 to 

15 635; GenBank accession nximber A53596; SEQ ID NO:l), which 
includes N-terminal tandem SH2 domains linked to a C- 
terminal catalytic domain. 

[0046] The term "generating a three-dimensional 
structure" or "generating a three-dimensional 

20 representation" refers to converting the lists of 

structure coordinates into structural models or graphical 
representation in three-dimensional space. This can be 
achieved through commercially or publicly available 
software. A model of a three-dimensional structure of a 

25 molecule or molecular complex can thus be constructed on 
a computer screen by a computer that is given the 
structure coordinates and that coinprises the correct 
software. The three-dimensional structure may be 
displayed or used to perform computer modeling or fitting 

30 operations. In addition, the structure coordinates 

themselves, without the displayed model, may be used to 
perform computer-based modeling and fitting operations. 
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[0047] The term ""homologue of Sykcat" or ^'Sylccat 
homologue" or "Syk catalytic domain homologfue" refers to a 
molecule that comprises a domain having at least 70%, 
80%, 90%, 95%, 96%, 97%, 98%, 99% or greater than 99% 
5 sequence identity to the catalytic domain of Syk. 

Examples of homologues include but are not limited to 
human Syk or SykB, Syk, SykB or the catalytic domain 
thereof from another species, with mutations, 
conservative substitutions, additions, deletions or a 

10 combination thereof. In one embodiment, the homologue 

comprises a domain having at least 95%, 96%, 97% , 98% or 
99% sequence identity to the catalytic domain of Syk, and 
has conservative mutations as compared to the catalytic 
domain of Syk. The homologue can be Syk, SykB or the 

15 catalytic domain thereof from another animal species. 
Such animal species include, but are not limited to, 
mouse, rat, a primate such as monkey or other primates. 
[0048] The term "homology model" refers to a 
structural model derived from known three-dimensional 

20 structure (s) . Generation of the homology model, termed 
"homology modeling", can include sequence alignment, 
residue replacement, residue conformation adjustment 
through energy minimization, or combination thereof. 
[0049] The term "interaction energy" refers to the 

25 energy determined for the interaction of a chemical 
entity and a binding pocket, domain, molecule or 
molecular complex or portion thereof. Interactions 
include but are not limited to one or more of covalent 
interactions, non-covalent interactions such as hydrogen 

30 bond, electrostatic, hydrophobic, aromatic, van der Waals 
interactions, and non-complementary electrostatic 
interactions such as repulsive charge- charge, dipole- 
dipole and charge-dipole. As interaction energies are 
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measured in negative values, the lower the value the more 
favorable the interaction. 

[0050] The term '*motif " refers to a group of amino 
acid residues in the Sykcat protein or homologue that 
5 defines a structural compartment or carries out a 
function in the protein, for example, catalysis, 
structural stabilization, or phosphorylation. The motif 
may be conserved in sequence, structure and function. 
The motif can be contiguous in primary sequence or three- 

10 dimensional space. Exait^les of a motif include but are 
not limited to the phosphorylation lip or activation 
loop, the glycine-rich phosphate anchor loop, the 
catalytic loop, the DF6 or DFGWSxxxxxxxRxTxCGTxDYLPPE 
loop (see, Xie et al., Structure, 6 pp. 983-991 (1998); 

15 Giet and Prigent, J. Cell. Sex., 112, pp. 3591-601 
(1991)) and the degradation box. 

[0051] The term ''part of a binding pocket refers to 
less than all of the amino acid residues that define the 
binding pocket. The structure coordinates of residues 

20 that constitute part of a binding pocket may be specific 
for defining the chemical environment of the binding 
pocket, or useful in designing fragments of an inhibitor 
that may interact with those residues. For example, the 
portion of residues may be key residues that play a role 

25 in ligand binding, or may be residues that are spatially 
related and define a three-dimensional compartment of the 
binding pocket. The residues may be contiguous or non- 
contiguous in primary sequence. In one embodiment, part 
of a binding pocket has at least two amino acid residues, 

30 preferably at least four, six or eight amino acid 
residues . 

[0052] The term ''part of a Sykcat protein" or "part of 
a Sykcat homologue" refers to less than all of the amino 
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acid residues of a Sykcat protein or homologue. In one 
embodiment, part of a Sykcat protein or homologue defines 
the binding pockets, domains, sub-domains, and motifs of 
the protein or homologue. The structure coordinates of 
5 residues that constitute part of a Sykcat protein or 
homologue may be specific for defining the chemical 
environment of the protein, or useful in designing 
fragments of an inhibitor that may interact with those 
residues. The portion of residues may also be residues 

10 that are spatially related and define a three-dimensional 
compartment of a binding pocket, motif or domain. The 
residues may be contiguous or non-contiguous in primary 
sequence. For exaitple, the portion of residues may be 
key residues that play a role in ligamd or substrate 

15 binding, peptide binding, antibody binding, catalysis, 
structural stabilization or degradation. 

[0053] The term ^^root mean sc[uare deviation" or ^^RMSD'' 
means the square root of the arithmetic mean of the 
squares of the deviations from the mean. It is a way to 

20 express the deviation or variation from a trend or 

object. For purposes of this invention, the ^^root mean * 
square deviation" defines the variation in the backbone 
atoms of a protein from the backbone atoms of Sykcat # a 
binding pocket, a motif, a domain, or portion thereof, as 

25 defined by the structure coordinates of Sykcat described 
herein. It would be apparent to the skilled worker that 
the calculation of RMSD involves a standard error of ± 
0.1 A. 

[0054] The term ^'soaked* refers to a process in which 
30 the crystal is transferred to a solution containing a 
compound of interest. 

[0055] The term '^structure coordinates'* refers to 
Cartesian coordinates derived from mathematical equations 
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related to the patterns obtained from diffraction of a 
monochromatic beam of X-rays by the atoms (scattering 
centers) of a protein or protein complex in crystal form. 
The diffraction data are used to calculate an electron 
5 density map of the repeating unit of the crystal. The 
electron density maps are then used to establish the 
positions of the individual atoms of the molecule or 
molecular complex. 

[0056] The term ^'sub-domain" refers to a portion of 
10 the domain as defined above in the Syk protein or 

homologue. The catalytic domain (approximately amino 
acid residues 343-639) of Syk is a bi-lobal structure 
consisting of an N- terminal, ^-strand sub-domain or lobe 
and a C- terminal, a-helical sub-domain or lobe. 
15 [0057] The term "substantially all of a Sykcat binding 
pocket" or "siibstantially all of a Sykcat protein" refers 
to all or almost all of the amino acid residues in the 
Sykcat binding pocket or protein. For example, 
substantially all of a Sykcat binding pocket can be 100%, 
20 95%, 90%, 80%, or 70% of the residues defining the Sykcat 
binding pocket or protein. 

[0058] The term '^substrate binding pocket" refers to 
the binding pocket for a substrate of Sylccat or homologue 
thereof. A substrate is generally defined as the 

25 molecule upon which an enzyme performs catalysis. 

Natural substrates, synthetic substrates or peptides, or 
mimics of a natural substrates of Sykcat or homologue 
thereof may associate with the substrate binding pocket. 
[0059] The term '*suf f iciently homologous to Sykcat" 

30 refers to a protein that has a sequence identity of at 
least 25% compared to Sykcat protein. In other 
embodiments, the seqnaence homology is at least 40%. In 
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other eiribodiments, the sequence identity is at least 50%, 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99%. 
[0060] The term '"Sykcat" or ''Sykcat protein" refers to 
the catalytic domain of h-uman Syk. 
5 [0061] The term ''SykcatB" or ''SykcatB protein" refers to 
the catalytic domain of SykB, a less common form of Syk 
protein that lacks a twenty-three amino acid residue 
insert in the linker region present in Syk. 
[0062] The ^'Sykcat ATP-binding pocket refers to a 

10 binding pocket of a molecule or molecular complex defined 
by the structure coordinates of a certain set of cimino 
acid residues present in the Sykcat structure, as 
described below. In general, the ligand for the ATP- 
binding pocket is a nucleotide such as ATP. This binding 

15 pocket is in the catalytic active site of the catalytic 
domain. In the protein kinase fcunily, the ATP-binding 
pocket is generally located at the interface of the a- 
helical and (^-strand sub-domains, and is bordered by the 
glycine rich loop and the hinge (See, Xie et al . , 

20 Structure, 6, pp. 983-991 (1998) , incorporated herein by 
reference) . 

[0063] The term ^'Sykcat-like" refers to all or a 
portion of a molecule or molecular complex that has a 
commonality of shape to all or a portion of the Sy]ccat 

25 protein. For example, in the Sykcat-lil^e ATP-binding 
pocket, the commonality of shape is defined by a root 
mean square deviation of the structure coordinates of the 
backbone atoms between the amino acid residues in the 
Sykcat-like ATP-binding pocket and the amino acid residues 

30 in the Sykcat ATP-binding pocket (as set forth in Figures 
1 or 2) . Compared to an amino acid in the Sylccat ATP- 
binding pocket, the corresponding amino acid residues in 
the Sykcat- like ATP-binding pocket may or may not be 
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identical . Depending on the Sykcat amino acid residues 
that define the Sykcat-ATP binding pocket, one skilled in 
the art would be able to locate the corresponding amino 
acid residues that define a Sykcat- like- ATP binding pocket 
5 in a protein based upon sequence or structural homology. 
[0064] The term ''Sykcat protein complex" or "'Sykcat 
homologue complex" refers to a molecular complex formed 
by associating the Sykcat protein or Sykcat homologue with 
at least a chemical entity, for example, a ligand, a 

10 substrate, nucleotide triphosphate, nucleotide 

diphosphate, phosphate, an agonist or antagonist, 
inhibitor, antibody, drug or compound. In one 
embodiment, the chemical entity is selected from the 
group consisting of ATP, an ATP analogue, a nucleotide 

15 triphosphate and ATP-binding pocket inhibitor. In 
another embodiment, the chemical entity is an ATP 
analogue such as Mg-AMP-PNP, or adenosine. Mg refers to 
Mg"*^^. In one embodiment, the chemical entity is PT426 or 
staurosporine . 

20 [0065] The term "three-dimensional structural 

information" refers to information obtained from the 
structure coordinates. Structural information generated 
can include the three-dimensional structure or graphical 
representation of the structure. Structural information 

25 can also be generated when subtracting distances between 
atoms in the structure coordinates, calculating chemical 
energies for a Sykcat molecule or molecular complex or 
homologues thereof, calculating or minimizing energies 
for an association of a Sykcat molecule or molecular 

30 complex or homologues thereof to a chemical entity. 
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Crystallizable Compositions and Crystals of Sykcat Protein 
and Protein Contplexes 

[0066] According to one ernbodiment , the invention 
5 provides a crystal or crystal composition comprising a 
catalytic domain of Syk protein (Sykcat) or homologue 
thereof in the presence or absence of a chemical entity. 
The catalytic domain of Syk protein may be phosphorylated 
or unphosphorylated. In one embodiment, the chemical 

10 entity binds to the active site. In one embodiment, the 
chemical entity is selected from the group consisting of 
an ATP analogue, ATP, adenosine, AMP-PNP, nucleotide 
triphosphate, nucleotide diphosphate, phosphate, 
staurosporine, an agonist, an antagonist and an active 

15 site inhibitor. In one embodiment, the chemical entity 
is staurosporine. In another embodiment, the chemical 
entity is AMP-PNP. In one embodiment, the chemical 
entity binds to the substrate binding pocket. In one 
embodiment, the chemical entity is selected from the 

20 group consisting of NAc~Glu-Glu-Asp~Asp-Tyr-Glu-Ser-Pro- 

NH2 (PT426) (SEQ ID N0:2), Glu-Glu-Asp-Asp-Tyr-Glu-Ser-Pro 
(SEQ ID NO: 5), a peptide comprising the amino acid 
sequence Glu-Asp-Asp-Tyr (residues 2-5 of SEQ ID NO: 5), a 
peptide comprising the amino acid sequence Asp-Asp-Tyr- 

25 Glu (residues 3-6 of SEQ ID NO: 5), a peptide comprising 
the amino acid sequence Asp-Tyr-Glu-Ser (residues 4-7 of 
SEQ ID NO: 5), a peptide comprising the amino acid 
sequence Tyr-Glu-Ser-Pro (residues 5-8 of SEQ ID NO: 5), a 
peptide comprising the amino acid sequence Glu-Glu-Asp- 

30 Asp-Tyr (residues 1-5 of SEQ ID NO: 5), a peptide 

comprising the cunino acid sequence Glu-Asp-Asp-Tyr-Glu 
(residues 2-6 of SEQ ID N0:5), a peptide comprising the 
amino acid sequence Asp- Asp-Tyr-Glu-Ser (residues 3-7 of 
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SEQ ID NO: 5), a peptide comprising the amino acid 
sequence Asp-Tyr-Glu-Ser-Pro (residues 4-8 of SEQ ID 
NO: 5), a peptide comprising amino acids Asp-Glu-Glu-Asp- 
Tyr (SEQ ID NO: 6), a peptide comprising amino acids Asp- 
5 Glu-Glu-Tyr-Asp (SEQ ID N0:7), a peptide comprising amino 
acids Asp-Glu-Tyr-Glu~Asp (SEQ ID NO: 8), a peptide 
comprising amino acids Asp-Tyr-Glu-Glu-Val (SEQ ID NO: 9), 
and a peptide comprising amino acids Tyr-Ser-Ile-Ile-Nle 
(SEQ ID NO: 10) . Peptides with SEQ ID NOs 6-10 were 

10 screened and found to be preferred substrates for Syk in 
United States Patent Application Publication No. 
2002/0155503, incorporated herein by reference. 
[0067] In one embodiment, the crystal has unit cell 
dimensions of a= 39.45 A , b= 84.17 A, c= 85.00 A, a= 

15 90^, P= 90^, Y= 90° and belongs to space group P2i2i2i. 

Preferably, the crystal comprises the Sykcat-staurosporine 
complex. In another embodiment, the crystal has a unit 
cell dimension of a= 39.58 A , b= 84.67 A, c= 90.63 A, a= 

90 o, P= 90 ^, Y= 90^ and belongs to space group P2i2i2i. 

20 In one embodiment, the crystal comprises the Sykcat-PT426- 
AMP-PNP complex. It will be readily apparent to those 
skilled in the art that the unit cells of the crystal 
compositions may deviate ± 1-2 A from the above cell 
dimensions depending on the deviation in the unit cell 

25 calculations. 

[0068] As used herein, the Sykcat protein in the 
crystal may be amino acid residues 343-635, 358-635 or 
364-634 of SEQ ID NO : 1 or fragments of at least 100 of 
these amino acid residues thereof, or the foregoing with 

30 conservative substitutions, deletions or insertions. 
[0069] The Sykcat protein or its homologue may be 
produced by any well-known method, including synthetic 



wo 2004/035604 



PCT/US2003/032812 



23 

methods, such as solid phase, liquid phase and 
combination solid phase/liquid phase syntheses; 
recombinant DNA methods, including cDNA cloning, 
optionally combined with site directed mutagenesis; 
5 and/or purification of a natural product. In another 
embodiment, the protein is produced recombinant ly and 
overexpressed in a baculovirus system. 

[0070] The invention also provides a method of making 
a crystal comprising a catalytic domain of Syk protein or 
10 a homologue thereof in the presence or absence of a 

chemical entity. Such methods coitqprise the steps of: 

a. producing and purifying a catalytic domain 
of Syk protein or homologrue thereof; 

b. combining said catalytic domain of Syk 
15 protein, or a homologue thereof in the presence or 

absence of a chemical entity with a crystallization 
solution to produce a crystallizable composition; and 

c. subjecting said crystallizable composition 
to conditions which promote crystallization. In one 

20 embodiment, the chemical entity binds to the active site 
of said Syk protein. In another embodiment, the chemical 
entity binds to the substrate binding site. 
[0071] The crystallization solution may include, but 
is not limited to, polyethylene glycol (PEG) at between 5 

25 to 40 % v/v, 50-300 mM acetate, and a buffer that 
maintains pH at between about 4 . 0 and 7.0. In one 
embodiment, the crystallizable composition comprises 
equal volumes of a solution of Sykcat. 20 mM 
diethanolamine (pH 8.6), 500 mM NaCl and 300 mM 

30 staurosporine, and a solution of 20 % polyethylene glycol 
with average molecular weight 2000 (PEG 2K) , 0.2 ammonium 
acetate, 0.1 M sodium cacodylate (pH 5.23). In one 
embodiment, the crystallizable composition comprises 
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equal volumes of a solution of Sykcat (2-4 itig/mL) , 20 mM . 
diethanolamine (pH 8.6), 500 nM NaCl, 2 niM AMP-PNP, 6 rnM 
MgCl2 and 500 niM of the peptide NAc-Glu-Glu-Asp-Asp-Tyr- 
Glu-Ser-Pro-NH2 (SEQ ID NO: 2), and a solution containing 
5 22% PEG 2K, 0.2 M magnesiiom acetate, 0.1 M sodium 

cacodylate (pH 5.23) . In another embodiment, the Syk 
protein in the crystallizable composition is at least 95% 
pure . 

[0072] In another embodiment, the method of making 
10 crystals of Sykcat proteins, Sykcat protein complexes, or 
homologues thereof includes the use of a device for 
promoting crystallizations. Devices for promoting 
crystallization include but are not limited to hanging- 
drop, sitting-drop, sandwich- drop, dialysis, microbatch 
15 or microtube batch devices (U^S. Patents 4,886,646, 

5,096,676, 5,130,105, 5,221,410 and 5,400,741; Pav et 
al.. Proteins; Structure/ Function, and Genetics, 20, pp. 
98-102 (1994); Chayen, Acta. Cryst., D54, pp. 8-15 
(1998), Chayen, Structure, 5, pp. 1269-1274 (1997), 
20 D'Arcy et al . , J. Cryst. Growth, 168, pp. 175-180 (1996) 
and Chayen, J. Appl . Cryst., 30, pp. 198-202 (1997), 
incorporated herein by reference) . 

[0073] The hanging-drop, sitting-drop and some 
adaptations of the microbatch methods (D'Arcy et al., J. 

25 Cryst. Growth, 168, pp. 175-180 (1996) and Chayen, J. 

Appl. Cryst., 30, pp. 198-202 (1997)) produce crystals by 
vapor diffusion. The hanging drop and sitting drop 
containing the crystallizable composition is equilibrated 
against a reservoir containing a higher or lower 

30 concentration of precipitant. As the drop approaches 

equilibrium with the reservoir, the saturation of protein 
in the solution leads to the formation of crystals. 
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[0074] Microseeding or seeding may be used to obtain 
larger, or better quality (i.e., crystals with higher 
resolution diffraction or single crystals) crystals from 
initial micro-crystals. Microseeding involves the use of 
5 crystalline particles to provide nucleation under 

controlled crystallization conditions. In this instance, 
micro-crystals are crushed to yield a stock seed 
solution. The stock seed solution is diluted in series. 
Using a needle, glass rod, or strand of hair, a small 

10 sample from each diluted solution is added to a set of 
equilibrated drops containing a protein concentration 
equal to or less than a concentration needed to create 
crystals without the presence of seeds. The aim is to 
end up with a single seed crystal that will act to 

15 nucleate crystal growth in the drop, 

[0075] It would be readily apparent to one of skill in 
the art following the teachings of the specification to 
vary the crystallization conditions disclosed herein to 
identify other crystallization conditions that would 

20 produce crystals of Sykcat protein, Sykcat protein complex 
or a homologue thereof. Such variations include, but are 
not limited to, adjusting pH, protein concentration 
and/or crystallization ten^erature, changing the identity 
or concentration of salt and/or precipitant used, using a 

25 different method for crystallization, or introducing 
additives such as detergents (e.g., TWEEN 20 
(monolaurate) , LDAO, Brij 30 (4 lauryl ether)), sugars 
(e.g., glucose, maltose), organic compounds (e.g., 
dioxane, dimethyl formamide) , lanthanide ions, or poly- 

30 ionic compounds that aid in crystallizations. High 

throughput crystallization assays may also be used to 
assist in finding or optimizing the crystallization 
condition. 
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Binding Pockets of the Sykcat Protein^ Protein Complexes 
or Homologues thereof 

[0076] In order to use the structure coordinates 
5 generated for the catalytic domain of Syk, its complexes, 
one of its binding pockets, or a Sykcat-like binding 
pocket thereof, it is often times necessary to convert 
the coordinates into a three-dimensional shape. This is 
be achieved through the use of a computer and 

10 commercially available software that is capable of 

generating three-dimensional graphical representations or 
three-dimensional information of molecules or portions 
thereof from a set of structure coordinates. 
[0077] Binding pockets, also referred to as binding 

15 sites in the present invention, are of significant 
utility in fields such as drug discovery. The 
association of natural ligands or substrates with the 
binding pockets of their corresponding receptors or 
enzymes is the basis of many biological mechanisms of 

20 action. Similarly, many drugs exert their biological 

effects through association with the binding pockets of 
receptors and enzymes. Such associations may occur with 
all or part of the binding pocket. An under steinding of 
such associations will help lead to the design of drugs 

25 having more favorable associations with their target 
receptor or enzyme, and thus, improved biological 
effects. Therefore, this information is valuable in 
designing potential inhibitors of the binding pockets of 
biologically important targets. The ATP and substrate 

30 binding pockets of this invention will be important for 
drug design. 

[0078] In one embodiment, the ATP-binding pocket 
comprises amino acids, L377, M424, V433, M448, A451, 
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L453, G454, L501, and S511 according to the structure of 
the Sykcat complexes in Figure 1 or 2 , In another 
embodiment, the ATP-binding pocket comprises amino acid 
residues L377, G378, S379, V385, A400, K402, V433, M448, 
5 E449, M450, A451, E452, P455, R498, N499, L501, S511 and 
D512 according to the structures of the Sykcat complexes 
in Figure 1 or 2 . 

[0079] In another embodiment, the ATP-binding pocket 
comprises amino acid residues L377, G378, S379, F382, 

10 V385, A400, K402, E420, V433, M448, E449, M450, A451, 

E452, L453, G454, P455, N457, R498, N499, L501, S511 and 
D512 according to the structure of the Sykcat- 
staurosporine complex in Figure 1. In another 
embodiment, the ATP-binding pocket comprises amino acid 

15 residues L377, G378, S379, V385, A400, K402, V433, M448, 
E449, M450, A451, E452, P455, K458, R498, N499, L501, 
S511 and D512 according to the structure of the Sykcat- 
PT426-AMP-PNP complex in Figure 2. These amino acid 
residues are within 5 A (^'5 A sphere amino acids") of 

20 staurosporine or AMP-PNP bound in the ATP-binding 

pockets, as identified using the program QUANTA (Accelrys 
©2001,2002) . 

[0080] In another embodiment, the ATP-binding pocket 
comprises amino acid residues K375, E376, L377, G378, 

25 S379, G380, N381, F382, G383, T384, V385, K386, K387, 
T398, V399, A400, V401, K402, E420, M424, V433, R434, 
L446, V447, M448, E449, M450, A451, E452, L453, G454, 
P455, L456, N457, K458, D494, A496, A497, R498, N499, 
V500, L501, L502, V503, K509, 1510, S511, D512, F513, and 

30 G514 according to the structure of the Sykcat- 
staurosporine complex in Figure 1. In another 
embodiment, the ATP-binding pocket comprises amino acid 
residues D376, L377, G378, 8379, G380, G383, T384, V385, 
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K386, T398, V399, A400, V401, K402, L417, E420, M424, 
V433, R434, M435, L446, V447, M448, E449, M450, A451, 
E452, L453, G454, P455, L456, N457, K458, D494, A497, 
R498, N499. V500, L501, L502, V503, K509, 1510, S511, 
5 D512, F513, G514, and L515 according to the structure of 
Sykcat-PT426-AMP-PNP complex in Figure 2. These amino 
acid residues are within 8 A (''8 A sphere amino acids") 
of staurosporine or AMP-PNP bound in the ATP-binding 
pockets, as identified using the program QUANTA (Accelrys 

10 ©2001,2002) . 

[0081] In another embodiment, the invention provides a 
crystalline molecule or molecular complex comprising all 
or part of a Sykcat substrate binding pocket defined by 
structure coordinates of a set of amino acid residues 

15 which are identical to Syk amino acid residues Asp494, 

Gly532, Lys533, Trp534 and Pro535 according to Figure 2. 
These Sykcat residues form hydrogen bonds with the peptide 
PT425. 

[0082] It will be readily apparent to those of skill 
20 in the art that the numbering of amino acid residues in 
other homologues of Sykcat niay be different than that set 
forth for Sykcat- Corresponding amino acid residues in 
homologues of Sykcat inay be identified by visual 
inspection of the amino acid sequences or by using 
25 commercially available sec[uence homology, structural 

homology or structure superimposition software programs. 
[0083] Those having skill in the art will understand 
that a set of structure coordinates for a molecule or a 
molecular-complex or a portion thereof, is a relative set 
30 of points that define a shape in three dimensions. Thus, 
it is possible that an entirely different set of 
coordinates could define a similar or identical shape. 
Moreover, slight variations in the individual coordinates 
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will have little effect on overall shape. In terms of 
binding pockets, these variations would not be expected 
to significantly alter the nature of ligands that could 
associate with those pockets. 
5 [0084] The variations in coordinates discussed above 
may be generated as a result of mathematical 
manipulations of the Sykcat structure coordinates. For 
example, the structure coordinates set forth in Figure 1 
or 2 may undergo crystallographic permutations of the 
10 structure coordinates, f ractionalization of the structure 
coordinates, integer additions or subtractions, inversion 
or any coiribination of the above. 

[0085] Alternatively, modifications in the crystal 
structure due to mutations, additions, substitutions, 

15 and/or deletions of amino acids, or other changes in any 
of the components that make up the crystal may also 
account for variations in structure coordinates. If such 
variations are within a certain RMSD as compared to the 
original coordinates, the resulting three-dimensional 

20 shape is considered encompassed by this invention. Thus, 
for example, a ligand that binds to the binding pocket of 
Sykcat would also be expected to bind to another binding 
pocket whose structure coordinates define a shape that 
falls within the acceptable RMSD. 

25 [0086] Various computational analyses may be used to 
determine whether a binding pocket, motif, domain or 
portion thereof of a molecule or molecular complex is 
sufficiently similar to the binding pocket, motif, domain 
or portion thereof of Sykcat- Such analyses may be 

30 carried out using well-known software applications, such 
as ProFit (A. C.R. Martin, SciTech Software, ProFit 
version 1.8, University College London, 
http://www.bioinf.org.uk/software), Swiss-Pdb Viewer 
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(Guex et al.. Electrophoresis, 18, pp. 2714-2723 (1997)), 
the Molecular Similarity application of QUANTA (Accelrys 
©2001,2002) and as described in the accompanying User's 
Guide, all of which are incorporated herein by reference. 
5 [0087] The above- identified programs, as well as 
others known to those of skill in the art, permit 
comparisons between different structures, different 
conformations of the same structure, and different parts 
of the same structure. The procedure used by QUANTA 

10 (Accelrys ©2001,2002) and Swiss-Pdb Viewer to compare 
structures requires four steps: 1) loading the 
structures to be compared; 2) defining the atom 
equivalences in these structures; 3) performing a fitting 
operation on the structures; and 4) analyzing the 

15 results. 

[0088] The procedure used in ProFit to compare 
structures includes: 1) loading the structures to be 
coiwpared; 2) specifying selected residues of interest; 3) 
defining the atom equivalences in the selected residues; 
20 4) performing a fitting operation on the selected 
residues; and 5) analyzing the results. 

[0089] Each structure in the comparison is identified 
by a name. One structure is identified as the target 
(i.e., the fixed structure); all other structures are 

25 loaded in as working structures (i.e., moving 

structures) . Since atom equivalency within the above 
programs is defined by user input, for the purpose of 
this invention we will define eq[uivalent atoms as protein 
backbone atoms (N, Ca, C and O) for Sykcat amino acid 

30 residues and corresponding amino acid residues in the 
structures being compared. 

[0090] The corresponding amino acid residues may be 
identified by sequence alignment programs such as the 
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"bestfit" program available from the Genetics Computer 
Group which uses the local homology algorithm described 
by Smith and Waterman in Advances in Applied Mathematics 
2, 482 (1981), which is incorporated herein by reference. 
5 A suitable amino acid sequence alignment will require 

that the proteins being aligned share minimum percentage 
of identical amino acids. Generally, a first protein 
being aligned with a second protein should share in 
excess of about 35% identical amino acid residues (Hanks 

10 et al.. Science, 241, 42 (1988); Hanks and Quinn, Methods 
in Enzymology, 200, 38 (1991)). The identification of 
equivalent residues can also be assisted by secondary 
structure alignment, for exemrple, aligning the a-helices, 
p-sheets in the structure. The program Swiss-Pdb Viewer 

15 has its own best fit algorithm that is based on secondary 
sequence alignment. 

[0091] When a rigid fitting method is used, the 
working structure is translated and rotated to obtain an 
optimum fit with the target structure. The fitting 

20 operation uses an algorithm that computes the optimum 
translation and rotation to be applied to the moving 
structure, such that the. root mean square difference of 
the fit over the specified pairs of equivalent atom is an 
absolute minimum. This nxjmber, given in angstroms, is 

25 reported by the above programs. The Swiss-Pdb Viewer 
program sets an RMSD cutoff for eliminating pairs of 
equivalent atoms that have high RMSD values. An RMSD 
cutoff value can be used to exclude pairs of equivalent 
atoms with extreme individual RMSD values. In the 

30 program ProFit, the RMSD cutoff value can be specified by 
the user. 

[0092] For the purpose of this invention, any 
molecule, molecular complex, binding pocket, motif. 
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domain thereof or portion thereof that is within an RMSD 
for backbone atoms (N, Ca, C, O) when superimposed on the 
relevant backbone atoms described by structure 
coordinates listed in Figure 1 or 2 are encompassed by 
5 this invention. In one embodiment, the amino acid 

residues that define a binding pocket of Syk protein are 
identical to the amino acid residues that define the 
binding pocket of Zap-70 protein. 

[0093] One embodiment of this invention provides a 
10 crystalline molecule or molecular complex comprising a 

domain defined by structure coordinates of a set of amino 
acid residues that are identical to Syk amino acid 
residues according to Figure 1 or 2, wherein the RMSD of 
the backbone atoms between said set of amino acid 
15 residues and said Syk amino acid residues is not more 

than about 5.0 A. In other embodiments, the RMSD between 
said set and amino acid residues of said Syk amino acid 
residues is not greater than about 4.0 A, not greater 
than about 3.0 A, not greater than about 2.0 A, not 
20 greater than about 1.5 A, not greater than about 1.0 A or 
not greater than about 0.5 A. 

[0094] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising 
substantially all of a domain defined by structure 

25 coordinates of a set of amino acid residues that are 

identical to Syk amino acid residues according to Figure 
1 or 2, wherein the RMSD of the backbone atoms between 
said set of amino acid residues of said molecule or 
molecular complex and said Syk amino acid residues is not 

30 more than about 5.0 A. In other embodiments, the RMSD 

between said set of amino acid residues of said molecule 
or molecular complex and said Syk amino acid residues is 
not greater than about 4.0 A, not greater than about 3 . 0 
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A, not greater than about 2.0 A, not greater than about 
1.5 A, not greater than about 1.0 A or not greater than 
about 0.5 A. 

[0095] Another embodiment of this invention provides a 
5 crystalline molecule or molecular complex comprising all 
or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
L377, M424, V433, M448, A451, G454, L501, and S511 

10 according to Figure 1 or 2, wherein the root mean square 
deviation of the backbone atoms between said at least 
four amino acid residues and said Syk amino acid residues 
which are identical is not greater than about 3.0 A. In 
another embodiment, the RMSD between said set of cimino 

15 acid residues of said molecule or molecular complex and 
said Syk amino acid residues is not greater than about 
2.0 A, l.OAorO.SA. In a further embodiment, the 
binding pocket is defined by at least six amino acid 
residues or all of the above Syk amino acid residues. 

20 [0096] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising all 
or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 

25 L377, F382, M424, V433, M448, A451, G454, L501, and S511 
according to Figure 1 or 2, wherein the root mean square 
deviation of the backbone atoms between said at least 
four amino acid residues and said Syk amino acid residues 
which are identical is not greater than about 3.0 A. In 

30 another embodiment, the RMSD between said set of eunino 
acid residues of said molecule or molecular complex and 
said Syk amino acid residues is not greater than about 
2,0 A, 1.0 A or 0.5 A. In a further embodiment, the 
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binding pocket is defined by at least six, eight or all 
of the above Syk amino acid residues. 

[0097] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising all 
5 or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
L377, F382, M424, V433, M448, A451, L453, G454, L501, and 
S511 according to Figure 1 or 2 , wherein the root mean 

10 sqpjare deviation of the backbone atoms between said at 
least four amino acid residues and said Syk amino acid 
residues which are identical is not greater than about 3 
A. In another embodiment, the RMSD between said set of 
amino acid residues of said molecule or molecular complex 

15 and said Syk amino acid residues is not greater than 
about 2.0 A, 1.0 A or 0.5 A. In a further embodiment, 
the binding pocket is defined by at least six, eight or 
all of the above Syk amino acid residues. 

[0098] Another embodiment of this invention provides a 
20 crystalline molecule or molecular complex comprising all 
or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
L377, G378, S379, V385, A400, K402, V433, M448, E449, 
25 M450, A451, E452, P455, R498, N499, L501, S511 and D512 
according to Figure 1 or 2, wherein the root mean square 
deviation of the backbone atoms between said at least 
four amino acid residues and said Syk amino acid residues 
which are identical is not greater than about 3 A. In 
30 another embodiment, the RMSD between said set of amino 
acid residues of said molecule or molecular complex and 
said Syk amino acid residues is not greater than about 
2.0 A, 1.0 A or 0.5 A. In a further embodiment, the 



wo 2004/035604 



PCT/US2003/032812 



35 

binding pocket is defined by at least six, eight, ten, 
twelve, fourteen, sixteen or all of the above Syk amino 
acid residues . 

[0099] Another embodiment of this invention provides a 
5 crystalline molecule or molecular complex comprising all 
or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
L377, G378, S379, F382, V385, A400, K402, E420, V433, 

10 M448, E449, M450, A451, E452, L453, G454, P455, N457, 
R498, N499, L501, S511 and D512 according to Figure 1, 
wherein the root mean sq[uare deviation of the backbone 
atoms between said at least four amino acid residues and 
said Syk amino acid residues which are identical is not 

15 greater than about 3 A. In another embodiment, the RMSD 
between said set of amino acid residues of said molecule 
or molecular complex and said Syk amino acid residues is 
not*^ greater than about 2.0 A, l.OAorO.SA, In a 
further embodiment, the binding pocket is defined by at 

20 least six, eight, ten, twelve, fourteen, sixteen, 

eighteen, twenty or all of the above Syk amino acid 
residues . 

[0100] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising all 

25 or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
L377, G378, S379, V385, A400, K402, V433, M448, E449, 
M450, A451, E452, P455, K458, R498, N499, L501, S511 and 

30 D512 according to Figure 2, wherein the root mean square 
deviation of the backbone atoms between said at least 
four amino acid residues and said Syk amino acid residues 
which are identical is not greater than about 3 A. In 
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another embodiment, the RMSD between said set of amino 
acid residues of said molecule or molecular complex and 
said Syk amino acid residues is not greater than about 
2.0 A, 1.0 A or 0.5 A. In a further embodiment, the 
5 binding pocket is defined by at least six, eight, ten, 
twelve, fourteen, sixteen, or all of the above Syk amino 
acid residues. 

[0101] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising all 

10 or part of a Sykcat ATP-binding pocket defined by a set of 
amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
K375, E376, L377, G378, S379, G380, N381, F382, G383, 
T384, V385, K386, K387, T398, V399, A400, V401, K402, 

15 E420, M424, V433, R434, L446, V447, M448, E449, M450, 
A451, E452, L453, G454, P455, L456, N457, K458, D494, 
A496, A497, R498, N499, V500, L501, L502, V503, K509, 
1510, S511, D512, F513, and G514 according to Figure 1, 
wherein the root mean square deviation of the backbone 

20 atoms between said at least four amino acid residues and 
said Syk amino acid residues which are identical is not 
greater than about 3 A. In another embodiment, the RMSD 
between said set of amino acid residues of said molecule 
or molecular complex and said Syk amino acid residues is 

25 not greater than about 2,0 A, 1.0 A or 0.5 A. In a 

further embodiment, the binding pocket is defined by at 
least six, eight, ten, twelve, fourteen, sixteen, 
eighteen, twenty, twenty-five, thirty, thirty-five, 
forty, forty-five or all of the above Syk amino acid 

30 residues. 

[0102] Another embodiment of this invention provides a 
crystalline molecule or molecular complex comprising all 
or part of a Sykcat ATP-binding pocket defined a set of 
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amino acid residues comprising at least four amino acid 
residues which are identical to Syk amino acid residues 
D376, L377, G378, S379, G380, G383, T384, V385, K386, 
T398, V399, A400. V401, K402, L417, E420, M424, V433. 
5 R434, M435, L446, V447, M448, E449, M450, A451, E452, 
L453, G454, P455, L456, N457, K458, D494, A497, R498, 
N499, V500, L501, L502, V503, K509, 1510, S511, D512, 
F513, G514, and L515 according to Figure 2, wherein the 
root mean square deviation of the backbone atoms between 

10 said at least four amino acid residues and said Syk amino 
acid residues which are identical is not greater than 
about 3 A. In another embodiment, the RMSD between said 
set of cunino acid residues of said molecule or molecular 
complex and said Syk amino acid residues is not greater 

15 than about 2.0 A, 1.0 A or 0.5 A. In a further 

embodiment, the binding pocket is defined by at least 
six, eight, ten, twelve, fourteen, sixteen, eighteen, 
twenty, twenty-five, thirty, thirty-five, forty, forty- 
five or all of the above Syk amino acid residues. 

20 [0103] One embodiment of this invention provides a 

crystalline molecule or molecular complex comprising all 
or part of a Sykcat substrate binding pocket defined by a 
set of amino acid residues comprising at least four amino 
acid residues which are identical to Syk amino acid 

25 residues Asp494, Gly532, Lys533, Trp534 and Pro535 
according to Figure 2, wherein the root niean square 
deviation of the backbone atoms between said at least 
four amino acid residues and said Syk amino acid residues 
which are identical is not greater than about 3 A. In 

30 another eihbodiment , the RMSD of the backbone atoms 

between said set of amino acid residues of said molecule 
or molecular complex and said Syk amino acid residues is 
not more than 2 . 0 A, 1.0 A or 0.5 A. 
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[0104] In another embodiment, the crystalline molecule 
or molecular complex above is a Syk catalytic domain or a 
Syk catalytic domain homologue. 

5 Computer Systems 

[0105] According to another embodiment, this invention 
provides a machine-readable data storage medium, 
comprising a data storage material encoded with machine- 
readable data, wherein said data defines the above- 
ID mentioned molecules or molecular complexes. In one 

embodiment, the data defines the above-mentioned binding 
pockets by comprising the structure coordinates of said 
amino acid residues according to Figure 1 or 2 . To use 
the structure coordinates generated for Sykcat/ homologues 
15 thereof, or one of its binding pockets, it is at times 

necessary to convert them into a three-dimensional shape 
or to extract three-dimensional structural information 
from them. This is achieved through the use of 
commercially or publicly available software that is 
20 capable of generating a three-dimensional structure or a 
three-dimensional representation of molecules or portions 
thereof from a set of structure coordinates- In one 
embodiment, the three-dimensional structure or 
representation may be displayed graphically. 
25 [0106] Therefore, according to another embodiment, 

this invention provides a machine-readable data storage 
medixim comprising a data storage material encoded with 
machine readable data. In one embodiment, a machine 
programmed with instructions for using said data, is 
30 capable of generating a three-dimensional structure or 
three-dimensional representation of any of the molecule 
or molecular coitplexes, or binding pockets thereof, that 
are described herein. 



wo 2004/035604 



PCT/US2003/032812 



39 

[0107] This invention also provides a con^uter 
comprising: 

a) a machine-readable data storage 
medium, comprising a data storage material encoded with 

5 machine-readable data, wherein said data defines any one 
of the above molecules or molecular complexes; 

b) a working memory for storing 
instructions for processing said machine-readable data; 

c) a central processing unit (CPU) 
10 coupled to said working memory and to said machine- 
readable data storage medium for processing said machine 
readable data and a means for generating three- 
dimensional staructural information of said molecule or 
molecule complex; and 

15 d) output hardware coupled to said 

central processing unit for outputting three-dimensional 
structural information of said molecule or molecular 
complex, or information produced by using said three- 
dimensional structural information of said molecule or 

20 molecular complex. 

[0108] In one embodiment, the data defines the binding 
pocket or domain of the molecule or molecular complex. 
[0109] Three-dimensional data generation may be 
provided by an instruction or set of instructions such as 

25 a computer program or commands for generating a three- 
dimensional structure or graphical representation from 
structure coordinates, or by subtracting distances 
between atoms, calculating chemical energies for a Sykcat 
molecule or molecular complex or homologues thereof, or 

30 calculating or minimizing energies for an association of 
Sykcat molecule or molecular complex or homologues thereof 
to a chemical entity. The graphical representation can 
be generated or displayed by commercially available 
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software programs . Examples of software programs include 
but are not limited to QUANTA (Accelrys ©2001, 2002), O 
(Jones at al., Acta Crystallogr. A47, pp. 110-119 (1991)) 
and RIBBONS (Carson, J. AppI . Crystallogr , , 24, pp. 9589- 
5 961 (1991)), which are incorporated herein by reference. 
Certain software progrcims may imbue this representation 
with physico-chemical attributes which are known from the 
chemical composition of the molecule, such as residue 
charge, hydrophobicity , torsional and rotational degrees 

10 of freedom for the residue or segment, etc. Examples of 
software progrcuns for calculating chemical energies are 
described in the Rational Drug Design section. 
[0110] Information about said binding pocket or 
information produced by using said binding pocket can be 

15 outputted through a display terminal, touchscreens , 

facsimile machines, modems, CD-ROMS, printers, a CD or 
DVD recorder, ZIP*" or JAZ™ drives or disk drives. The 
information can be in graphical or alphanumeric form. 
[0111] In one embodiment, the computer is executing an 

20 instruction such as a computer program for three 

dimensional data generation. In another embodiment, the 
computer further comprises a commercially available 
software program to display the information as a 
graphical representation. Examples of software programs 

25 include but are not limited to QUANTA (Accelrys 

©2001,2002), O (Jones et al.. Acta Cryst., A47, pp. 110- 
119 (1991)) and RIBBONS (Carson, J. Appl. Crystallogr . , 
24, pp. 9589-961 (1991)), all of which are incorporated 
herein by reference. 

30 [0112] Figure 7 demonstrates one version of these 
embodiments. System (10) includes a computer (11) 
comprising a central processing unit ("CPU") (20), a 
working memory (22) which may be, e.g., RAM (random- 
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access memory) or "core" memory, mass storage memory (24) 
(such as one or more disk drives or CD-ROM drives) , one 
or more cathode-ray tube ("CRT") display terminals (26), 
one or more keyboards (28) , one or more input lines (30) , 
5 and one or more output lines (40), all of which are 

interconnected by a conventional bi-directional system 
bus (50) . 

[0113] Input hardware (35) , coupled to computer (11) 
by input lines (30), may be implemented in a variety of 
ways. Machine-readable data of this invention may be 
inputted via the use of a modem or modems (32) connected 
by a telephone line or dedicated data line (34) . 
Alternatively or additionally, the input hardware (35) 
may comprise CD-ROM drives or disk drives (24) . In 
conjxinction with display terminal (26) , keyboard (28) may 
also be used as an input device. 

[0114] Output hardware (46), coupled to computer (11) 
by output lines (40) , may similarly be implemented by 
conventional devices. By way of example, output hardware 
(46) may include CRT display terminal (26) for displaying 
a graphical representation of a binding pocket of this 
invention using a program such as QUANTA as described 
herein. Output hardware may also include a printer (42) , 
so that hard copy output may be produced, or a disk drive 
(24), to store system output for later use. Output 
hardware may also include a display terminal, 
touchscreens , facsimile machines, modems, CD-ROMS, 
printers, a CD or DVD recorder, ZIP™ or JAZ™ drives, disk 
drives, or other machiner readable data storage device. 
[0115] In operation, CPU (20) coordinates the use of 
the various input and output devices (35) , (46) , 
coordinates data accesses from mass storage (24) and 
accesses to and from working memory (22) , and determines 
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the secjuence of data processing steps . A number of 
programs may be used to process the machine -readable data 
of this invention. Such programs are discussed in 
reference to the computational methods of drug discovery 
5 as described herein. Specific references to components 
of the hardware system (10) are included as appropriate 
throughout the following description of the data storage 
medium . 

[0116] Figure 8 shows a cross section of a magnetic 

10 data storage medium (100) which can be encoded with a 

machine-readable data that can be carried out by a system 
such as system (10) of Figure 7. Medivim (100) can be a 
conventional floppy diskette or hard disk, having a 
suitable svibstrate (101), which may be conventional, and 

15 a suitable coating (102), which may be conventional, on 
one or both sides, containing magnetic domains (not 
visible) whose polarity or orientation can be altered 
magnetically. Mediiam (100) may also have an opening (not 
shown) for receiving the spindle of a disk drive or other 

20 data storage device (24) . 

[0117] The magnetic domains of coating (102) of medium 
(100) are polarized or oriented so as to encode in manner 
which may be conventional, machine readable data such as 
that described herein, for execution by a system such as 

25 system (10) of Figure 7. 

[0118] Figure 9 shows a cross section of an optically- 
readable data storage medium (110) which also can be 
encoded with such a machine- readable data, or set of 
instructions, which can be carried out by a system such 

30 as system (10) of Figure 7. Medium (110) can be a 

conventional compact disk read only memory (CD-ROM) or a 
rewritable medivim such as a magneto-optical disk which is 
optically readable and magneto -optically writable. 
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Medivun (100) preferably has a suitable substrate (111), 
which may be conventional, and a suitable coating (112) , 
which may be conventional, usually on one side of 
substrate (111) . 
5 [0119] In the case of CD-ROM, as is well known, the 
coating (112) is reflective and is impressed with a 
plurality of pits (113) to encode the machine- readable 
data. The arrangement of pits is read by reflecting 
laser light off the surface of the coating (112) . A 
10 protective coating (114) , which preferably is 

svibstantially transparent, is provided on top of the 
coating (112) . 

[0120] In the case of a magneto-optical disk, as is 
well known, the coating (112) has no pits (113), but has 

15 a plurality of magnetic domains whose polarity or 

orientation can be changed magnetically when heated above 
a certain temperature, as by a laser (not shown) . The 
orientation of the domains can be read by measuring the 
polarization of laser light reflected from the coating 

20 (112) . The arrangement of the domains encodes the data 
as described above. 

[0121] In one embodiment, the structure coordinates of 
said molecules or molecular complexes are produced by 
homology modeling of at least a portion of the structure 

25 coordinates of Figure 1 or 2 . Homology modeling can be 

used to generate structural models of Sykcat homologues or 
other homologous proteins based on the known structure of 
Sylccat- This can be achieved by performing one or more of 
the following steps: performing sequence alignment 

30 between the amino acid sequence of an unknown molecule 
against the amino acid sequence of Sykcat; identifying 
conserved and variable regions by sequence or structure; 
generating structure coordinates for structurally conserved 
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residues of the \inknown structure from those of Sykcat; 
generating conformations for the structurally variable 
residues in the xmknown structure; replacing the non- 
conserved residues of Sykcat with residues in the unknown 
5 structure; building side chain conformations; and refining 
and/or evaluating the unknown structure. 

[0122] Software programs that are useful in homology 
modeling include XALIGN (Wishart, D. S. et al . , Comput. 
Appl. Biosci., 10, pp. 687-88 (1994)) and CLUSTAL W 
10 Alignment Tool (Higgins D, G. et al.. Methods Enzymol., 
266, pp. 383-402 (1996)). See also, U.S. Patent No. 
5,884,230. These references are incorporated herein by 
reference. 

[0123] To perform the sequence alignment, programs such 

15 as the "bestfit" program available from the Genetics 
Computer Group (Waterman in Advances in Applied 
Mathematics 2, 482 (1981), which is incorporated herein 
by reference] and CLUSTAL W Alignment Tool (Higgins D. G. 
et al., Methods Enzymol., 266, pp. 383-402 (1996), which 

20 is incorporated by reference] can be used. To model the 

amino acid side chains of another protein, the amino acid 
residues in Sykcat can be replaced, using a computer 
graphics program such as "O" (Jones et al, (1991) Acta 
Cryst. Sect. A, 47: 110-119), by those of the homologous 

25 protein, where they differ. The same orientation or a 
different orientation of the amino acid can b6 used. 
Insertions and deletions of amino acid residues may be 
necessary where gaps occur in the sequence alignment. 
However, certain portions of the active site of Sykcat and 

30 its homologues may be highly conserved with essentially no 
insertions and deletions. 

[0124] Homology modeling can be performed using, for 
example, the computer programs SWISS-MODEL available 



wo 2004/035604 



PCT/US2003/032812 



45 

through Glaxo Wellcome Experimental Research in Geneva, 
Switzerland; WHATIF available on EMBL servers; Schnare et 
al., J. Mol. Biol., 256: 701-719 (1996); Blundell et al.. 
Nature 326: 347-352 (1987); Fetrow and Bryant, 
5 Bio/Technology 11:479-484 (1993); Greer, Methods in 

Enzymology 202: 239-252 (1991); and Johnson et al, Crit. 
Rev. Biochem. Mol. Biol. 29:1-68 (1994). An example of 
homology modeling can be found, for example, in Szklarz 
G-D., Life Sci. 61: 2507-2520 (1997). These references 

10 are incorporated herein by reference. 

[0125] Thus, in accordance with the present invention, 
data capable of generating the three dimensional 
structure or three-dimensional representation of the 
above molecules or molecular conqplexes, or binding 

15 pockets or domains thereof, can be stored in a machine- 
readable storage medium, which is capable of displaying 
structural information or a graphical three-dimensional 
representation of the structure. In one embodiment, the 
means of generating three-dimensional structural 

20 information is provided by means for generating a three- 
dimensional structural representation of the binding 
pocket or domain of a molecule or molecular complex. 

Rational Drug Design 

25 [0126] The Sykcat structure coordinates or the three- 
dimensional graphical representation generated from these 
coordinates may be used in conjunction with a computer 
for a variety of purposes, including drug discovery. 
[0127] For excunple, the structure encoded by the data 

30 may be computationally evaluated for its ability to 

associate with chemical entities. Chemical entities that 
associate with Sykcat niay inhibit or activate Syk or its 
homologues, and are potential drug candidates. 
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Alternatively, the structure encoded by the data may be 
displayed in a graphical three-dimensional representation 
on a computer screen. This allows visual inspection of 
the structure, as well as visual inspection of the 
5 structure's association with chemical entities. 

[0128] In one embodiment, the invention provides for a 
method of using a computer for selecting an orientation 
of a chemical entity that interacts favorably with a 
binding pocket or domain comprising the steps of: 
10 (a) providing the structure coordinates 

of the binding pocket or domain on a coinputer comprising 
the means for generating three-dimensional structural 
information from the structure coordinates; 

(b) employing computational means to dock 
15 a first chemical entity in the binding pocket or domain; 

(c) quantitating the interaction energy 
between the chemical entity and all or part of the 
binding pocket or domain for different orientations of 
the chemical entity; and 

20 (d) selecting the orientation of the 

chemical entity with the most favorable interaction 
energy . 

[0129] In one embodiment, the docking is facilitated 
by said quant itated interaction energy. 
25 [0130] In one embodiment, the above method further 
comprises the following steps before step (a) : 

(e) producing a crystal of a molecule or 
molecular complex comprising Sykcat or homologue thereof; 

(f) determining the three-dimensional 
30 structure coordinates of the molecule or molecular 

complex by X-ray diffraction of the crystal; and 

(g) identifying all or part of said 

binding pocket. 
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[0131] Three-dimensional structural information in 
step (a) may be generated by instructions such as a 
computer program or commands that can generate a three- 
dimensional representation; subtract distances between 
5 atoms; calculate chemical energies for a Sykcat molecule, 
molecular complex or homologues thereof; or calculate or 
minimize the chemical energies of an association of Syk 
molecule, molecular complex or homologues thereof to a 
chemical entity. These types of computer programs are 

10 knovnn in the art. The graphical representation can be 

generated or displayed by commercially available software 
programs. Examples of software programs include but are 
not limited to QUANTA (Accelrys ©2001, 2002), O (Jones et 
al.. Acta Crystallogr. A47, pp. 110-119 (1991)) and 

15 RIBBONS (Carson, J. Appl. Crystallogr, , 24, pp. 9589-961 
(1991)), which are incorporated herein by reference. 
Certain software programs may imbue this representation 
with physico-chemical attributes which are known from the 
chemical composition of the molecule, such as residue 

20 charge, hydrophobicity , torsional and rotational degrees 
of freedom for the residue or segment, etc. Examples of 
software programs for calculating chemical energies are 
described below. 

[0132] The above method may further comprise the 
25 following step after step (d) : outputting said 

quantified interaction energy to a suitable output 
hardware, such as a CRT display terminal, a CD or DVD 
recorder, ZIP™ or JAZ™ drive, a disk drive, or other 
machine-readable data storage device, as described 
30 previously. The method may further comprise generating a 
three-dimensional structure, graphical representation 
thereof, or both, of the molecule or molecular complex 
prior to step (b) . 
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[0133] One embodiment of this invention provides for 
the above method, wherein energy minimization with or 
without molecular dynamics simulations are performed 
simultaneously with or following step (b) . 
5 [0134] The above method may further comprise the steps 
of: 

(e) repeating steps (b) through (d> with 
a second chemical entity; and 

(f) selecting at least one of said first 
10 or second chemical entity that interacts more favorably 

with said binding pocket or domain based on said 
quantitated interaction energy of said first or second 
chemical entity. 

[0135] In one embodiment, the invention provides for a 
15 method of using a conputer for selecting an orientation 
of a chemical entity with a favorable shape 
complementarity in a binding pocket comprising the steps 
of: 

(a) providing the structure coordinates 
20 of said binding pocket and ligand bound therein on a 

computer comprising the means for generating three- 
dimensional structural information from said structure 
coordinates ; 

(b) employing computational means to dock 
25 a first chemical entity in the binding pocket; 

(c) quantitating the contact score of 
said chemical entity in different orientions; and 

(d) selecting an orientation with the 
highest contact score. 

30 [0136] In one embodiment, the docking is facilitated 
by the contact score. 

[0137] The method above may further coitprise the step 
of generating a three-dimensional graphical 
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representation of the binding pocket and ligand bound 
therein prior to step (b) . 

[0138] The method above may further comprise the steps 
of: 

5 (e) repeating steps (b) through (d) with 

a second chemical entity; and 

(f) selecting at least one of said first 
or second chemical entity that has a higher contact score 
based on said quauititated contact score of said first or 

10 second chemical entity. 

[0139] In another embodiment, the invention provides a 
method for screening a plurality of chemical entities to 
associate at a deformation energy of binding of less than 
-7 kcal/mol with said binding pocket: 

15 (a) employing computational means, which 

utilize said structure coordinates to dock one of said 
chemical entities from the plurality of chemical entities 
in said binding pocket; 

(b) quantifying the deformation energy of 
20 binding between the chemical entity and the binding 

pocket ; 

(c) repeating steps (a) and (b) for each 
remaining chemical entity; and 

(d) output ting a set of chemical entities 
25 that associate with the binding pocket at a deformation 

energy of binding of less than -7 kcal/mol to a suitable 
output hardware. 

[0140] In another embodiment, the method comprises the 
steps of: 

30 (a) constructing a computer model of a 

binding pocket of the molecule or molecular complex; 

(b) selecting a chemical entity to be 
evaluated by a method selected from the group consisting 
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of assembling said chemical entity; selecting a chemical 
entity from a small molecule database; de novo ligand 
design of said chemical entity; and modifying a known 
agonist or inhibitor, or a portion thereof, of a Syk 
5 protein or homologue thereof; 

(c) employing computational means to dock 
said chemical entity to be evaluated in said binding 
pocket in order to provide an energy-minimized 
configuration of said chemical entity in the binding 

10 pocket; and 

(d) evaluating the results of said docking 
to quantify the interaction energy between said chemical 
entity and the binding pocket. 

[0141] Alternatively, the structure coordinates of the 
15 Sykcat binding pockets may be utilized in a method for 
identifying a candidate inhibitor of a molecule 
comprising a binding pocket of Sykcat • This method 
comprises the steps of: 

(a) using a three-dimensional structure of the 
20 binding pocket or domain to design, select or optimize a 

plurality of chemical entities; 

(b) contacting each chemical entity with the 
molecule or the molecular complex; 

(c) monitoring the inhibition to the catalytic 
25 activity of the molecule or molecular complex by the 

chemical entity; and 

(d) selecting a chemical entity based on the 
inhibitory effect of the chemical entity on the catalytic 
activity of the molecule or molecular complex. 

30 [0142] In one embodiment, step (a) is performed using 
a three-dimensional structure of the binding pocket or 
domain or portion thereof of the molecule or molecular 
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complex. In another embodiment, the three-dimensional 
structure is displayed as a graphical representation. 
[0143] In another embodiment, the method comprises the 
steps of: 

5 (a) constructing a computer model of a 

binding pocket of the molecule or molecular complex; 

(b) selecting a chemical entity to be 
evaluated by a method selected from the group consisting 
of assembling said chemical entity; selecting a chemical 

10 entity from a small molecule database; de novo ligand 
design of said chemical entity; and modifying a known 
agonist or inhibitor, or a portion thereof, of a Syk 
protein or homologue thereof; 

(c) employing computational means to dock 
15 said chemical entity to be evaluated in said binding 

pocket in order to provide an energy-minimized 

conf igruration of said chemical entity in the binding 

pocket; 

(d) evaluating the results of said docking 
20 to quantify the interaction energy between said chemical 

entity and the binding pocket; 

(e) synthesizing said chemical entity; and 

(f) contacting said chemical entity with 
said molecule or molecular complex to determine the 

25 ability of said con5)Ound to activate or inhibit said 
molecule. 

[0144] In one embodiment, the invention provides a 
method of designing a con^jound or complex that interacts 
with all or part of the binding pocket comprising the 
30 steps of: 

(a) providing the structure coordinates 
of said binding pocket or domain on a computer comprising 
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the means for generating three-dimensional structural 
information from said structure coordinates; 

(b) using the computer to dock a first 
chemical entity in part of the binding pocket or domain; 
5 (c) docking at least a second chemical 

entity in another part of the binding pocket or domain; 

(d) quantifying the interaction energy 
between the first or second chemical entity and part of 
the binding pocket or domain; 

10 (e) repeating steps (b) to (d) with 

another first and second chemical entity, selecting a 
first and a second chemical entity based on said 
quantified interaction energy of of all of said first and 
second chemical entity; 

15 (f) optionally, visually inspecting the 

relationship of the first and second chemical entity to 
each other in relation to the binding pocket or domain on 
a computer screen using the three-dimensional graphical 
representation of the binding pocket or domain and said 

20 first and second chemical entity; and 

(g) assembling the first and second 
chemical entity into a compound or complex that interacts 
with said binding pocket or domain by model building. 
[0145] For the first time, the present invention 

25 permits the use of molecular design techniques to 

identify, select and design chemical entities, including 
inhibitory compounds, capable of binding to Sykcat or 
Sylccat-like binding pockets, motifs and domains. 
[0146] Applicants' elucidation of binding pockets on 

30 Sykcat provides the necessary information for designing 
new chemical entities and compounds that may interact 
with Sykcat substrate or ATP-binding pockets or Sykcat- like 
substrate or ATP-binding pockets, in whole or in part. 
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Due to the homology in the kinase core between Sykcat and 
ZAP-70, compounds that inhibit Sykcat are also expected to 
inhibit ZAP-70, especially those compounds that bind the 
ATP-binding pocket. Throughout this section, discussions 
5 about the ability of a chemical entity to bind to, 

associate with, or inhibit Sykcat binding pockets refer to 
features of the entity alone. 

[0147] The design of compoiinds that bind to or inhibit 
Sykcat binding pockets according to this invention 

10 generally involves consideration of two factors. First, 
the chemical entity must be capable of physically and 
structurally associating with parts or all of the Sykcat 
binding pockets . Non-covalent molecular interactions 
important in this association include hydrogen bonding, 

15 van der Waals interactions, hydrophobic interactions and 
electrostatic interactions . 

[0148] Second, the chemical entity must be able to 
assume a conformation that allows it to associate with 
the Sykcat binding pockets directly. Although certain 

20 portions of the chemical entity will not directly 

participate in these associations, those portions of the 
chemical entity may still influence the overall 
conformation of the molecule. This, in turn, may have a 
significant impact on potency. Such conformational 

25 requirements include the overall three-dimensional 
structure and orientation of the chemical entity in 
relation to all or a portion of the binding pocket, or 
the spacing between functional groups of a chemical 
entity coinprising several chemical entities that directly 

30 interact with the Sykcat or Sykcat-like binding pockets. 

[0149] The potential inhibitory or binding effect of a 
chemical entity on Sykcat binding pockets may be analyzed 
prior to its actual synthesis and testing by the use of 
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computer modeling techniques. If the theoretical 
structure of the given entity suggests insufficient 
interaction and association between it and the Sykcat 
binding pockets, testing of the entity is obviated. 
5 However, if computer modeling indicates a strong 

interaction, the molecule may then be synthesized and 
tested for its ability to bind to a Sykcat binding pocket. 
This may be achieved by testing the ability of the 
molecule to inhibit the catalytic domain of Syk by 

10 methods disclosed in International PCT Applications WO 

01/09134, WO 01/47922 and United States Patent 6,114,333, 
all of which are specifically incorporated herein by 
reference. In this manner, synthesis of inoperative 
compounds may be avoided. 

15 [0150] A potential inhibitor that binds to a binding 
pocket may be computationally evaluated by means of a 
series of steps in which chemical entities or fragments 
are screened and selected for their ability to associate 
with the above binding pockets. 

20 [0151] One skilled in the art may use one of several 
methods to screen chemical entities or fragments or 
moieties thereof for their ability to associate with the 
binding pockets described herein. This process may begin 
by visual inspection of, for example, any of the binding 

25 pockets on the computer screen based on the Syk structure 
coordinates in Figure 1 or 2 or other coordinates that 
define a similar shape generated from the machine- 
readable storage medixim. Selected chemical entities, or 
fragments or moieties thereof may then be positioned in a 

30 variety of orientations, or docked, within that binding 
pocket as defined supra. Docking may be accomplished 
using software such as QUANTA and Sybyl (Tripos 
Associates, St. Louis, MO), followed by, or performed 
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simultaneously with, energy minimization, rigid-body 
minimization (Gshwend, supra) and molecular dynamics with 
standard molecular mechanics £orce fields, such as CHARMM 
and AMBER. 

5 [0152] Specialized computer programs may also assist 
in the process of selecting fragments or chemical 
entities or fragments or moieties thereof. These 
include : 

1. GRID (P. J. Goodford, "A Computational 

10 Procedure for Determining Energetically Favorable Binding 
Sites on Biologically Important Macromolecules " , J". Med. 
Chem., 28, pp. 849-857 (1985) )• GRID is available from 
Oxford University, Oxford, UK. 

2. MCSS (A. Miranker et al., "Functionality Maps 
15 of Binding Sites: A Multiple Copy Simultaneous Search 

Method." Proteins: Structure, Function and Genetics, 
11, pp. 29-34 (1991)). MCSS is available from Molecular 
Simulations, San Diego, CA. 

3. AUTODOCK (D. S. Goodsell et al., "Automated 
20 Docking of Substrates to Proteins by Simulated 

Annealing", Proteins: Structure, Function, and Genetics, 
8, pp. 195-202 (1990)). AUTODOCK is available from 
Scripps Research Institute, La Jolla, CA, 

4. DOCK (I. D. Kuntz et al . , "A Geometric Approach 
25 to Macromolecule-Ligand Interactions", J. Mol . Biol,, 

161, pp. 269-288 (1982)). DOCK is available from 
University of California, San Francisco, CA. 
[0153] Once suitable chemical entities or fragments 
have been selected, they can be assembled into a single 
30 compound or complex. Assembly may be preceded by visual 
inspection of the relationship of the fragments to each 
other on the three-dimensional image displayed on a 
computer screen in relation to the structure coordinates 
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of Sykcat- This would be followed by manual model 
building using software such as QUANTA or Sybyl . 
[0154] Useful programs to aid one of skill in the art 
in connecting the individual chemical entities or 
5 fragments include: 

1. CAVEAT (P. A. Bartlett efc al . , "CAVEAT: A 
Program to Facilitate the Structure-Derived Design of 
Biologically Active Molecules", in "Molecular Recognition 
in Chemical and Biological Problems*' , Special Pub., Royal 

10 Chem. Soc, 78, pp. 182-196 (1989); G- Lauri and P. A. 

Bartlett, "CAVEAT: a Progreun to Facilitate the Design of 
Organic Molecules", J. Comput. Aided Mol. Des. , 8, pp. 
51-66 (1994)). CAVEAT is available from the University of 
California, Berkeley, CA. 

15 2 . 3D Database systems such as ISIS (MDL 

Information Systems, San Leandro, CA) . This area is 
reviewed in Y. C. Martin, "3D Database Searching in Drug 
Design", J. Med. Chem., 35, pp. 2145-2154 (1992). 

3. HOOK (M. B. Eisen et al . , "HOOK: A Program for 

2 0 Finding Novel Molecular Architectures that Satisfy the 
Chemical and Steric Requirements of a Macromolecule 
Binding Site", Proteins: Struct., Funct., Genet., 19, 
pp. 199-221 (1994) . HOOK is available from Molecular 
Simulations, San Diego, CA. 

25 [0155] Instead of proceeding to build an agonist or an 
inhibitor of any of the above binding pockets in a step- 
wise fashion, one fragment or chemical entity at a time 
as described above, inhibitory or other Syk binding 
compounds may be designed as a whole or "de novo" using 

30 either an empty binding pocket or optionally including 

some portion (s) of a known inhibitor (s) . There are many 
de novo ligand design methods including: 
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1. LUDI (H.-J. Bohm, "The Computer Program LUDI: 
A New Method for the De Novo Design of Enzyme 
Inhibitors", J". Comp. Aid. Molec . Design, S, pp. 61-78 
(1992)). LUDI is available from Molecular Simulations 

5 Incorporated, San Diego, CA. 

2. LEGEND (Y. Nishibata et al . , Tetrahedron, 47, 
p. 8985 (1991)). LEGEND is available from Molecular 
Simulations Incorporated, San Diego, CA. 

3. LeapFrog (available from Tripos Associates, St. 
10 Louis, MO) . 

4. SPROUT (V. Gillet et al., "SPROUT: A Program 
for Structure Generation)", J". Comput. Aided Mol. Design, 
7, pp. 127-153 (1993)). SPROUT is available from the 
University of Leeds, UK. 

15 [0156] Other molecular modeling techniques may also be 
einployed in accordance with this invention (see, e.g., N. 
C. Cohen et al., "Molecular Modeling Software and Methods 
for Medicinal Chemistry, J. Med. Chem. , 33, pp. 883-894 
(1990) ; see also, M. A. Navia and M. A. Murcko, "The Use 

20 of Structural Information in Drug Design", Current 

Opinions in Structural Biology, 2, pp. 202-210 (1992); L. 
M. Balbes et al . , "A Perspective of Modern Methods in 
Computer-Aided Drug Design", in Reviews in Computational 
Chemistry, Vol. 5, K. B. Lipkowitz and D. B. Boyd, Eds., 

25 VCH, New York, pp. 337-380 (1994); see also, W. C. Guida, 
"Software For Structure-Based Drug Design", Curr. Opin. 
Struct. Biology, 4, pp. 777-781 (1994)). 
[0157] Once a chemical entity has been designed or 
selected by the above methods, the efficiency with which 

30 that entity may bind to any of the above binding pockets 
may be tested and optimized by computational evaluation. 
For example, an effective binding pocket inhibitor must 
preferably demonstrate a relatively small difference in 
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energy between its bound and free states (i,e., a small 
deformation energy of binding) . Thus, the most efficient 
binding pocket inhibitors should preferably be designed 
with a deformation energy of binding of not greater than 
5 about 10 kcal/mole, more preferably, not greater than 7 
kcal/mole. Binding pocket inhibitors may interact with 
the binding pocket in more than one conformation that is 
similar in overall binding energy. In those cases, the 
deformation energy of binding is taken to be the 
10 difference between the energy of the free entity and the 
average energy of the conformations observed when the 
inhibitor binds to the protein. 

[0158] A chemical entity designed or selected as 
binding to any one of the above binding pockets may be 

15 further computationally optimized so that in its bound 
state it would preferably lack repulsive electrostatic 
interaction with the target enzyme and with the 
surrounding water molecules. Such non- complementary 
electrostatic interactions include repulsive charge- 

20 charge, dipole-dipole and charge-dipole interactions. 

[0159] Specific computer software is available in the 
art to evaluate compound deformation energy and 
electrostatic interactions. Examples of programs 
designed for such uses include: Gaussian 94, revision C 

25 (M. J. Frisch, Gaussian, Inc., Pittsburgh, PA ©1995) ; 
AMBER, version 4.1 (P. A. Kollman, University of 
California at San Francisco, ©1995) ; QUANTA/CHARMM 
(Molecular Simulations, Inc., San Diego, CA ©1998); 
Insight II /Discover (Molecular Simulations, Inc., San 

30 Diego, CA ©1998) ; DelPhi (Molecular Simulations, Inc., 
San Diego, CA ©1998); and AMSOL (Quantxim Chemistry 
Program Exchange, Indiana University) . These programs 
may be implemented, for instance, using a Silicon 
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Graphics workstation such as an Indigo2 with "IMPACT" 
graphics. Other hardware systems and software packages 
will be known to those skilled in the art. 
[0160] Another approach enabled by this invention, is 
5 the computational screening of small molecule databases 
for chemical entities or compoxinds that can bind in 
whole, or in part, to any of the above binding pockets. 
In this screening, the quality of fit of such entities to 
the binding pocket may be judged either by shape 

10 complementarity or by estimated interaction energy (E. C. 
Meng et al., J. Comp. Chem. , 13, pp. 505-524 (1992)). 
[0161] Another particularly useful drug design 
technique enabled by this invention is iterative drug 
design. Iterative drug design is a method for optimizing 

15 associations between a protein and a compound by 
determining and evaluating the three-dimensional 
structures of successive sets of protein/ compound 
complexes . 

[0162] According to another embodiment, the invention 
20 provides chemical entities which associate with a Sykcat 
binding pocket produced or identified by the method set 
forth above. 

[0163] Another particularly useful drug design 
technique enabled by this invention is iterative drug 
25 design. Iterative drug design is a method for optimizing 
associations between a protein and a chemical entity by 
determining and evaluating the three-dimensional 
structures of successive sets of protein/chemical entity 
complexes . 

30 [0164] In iterative drug design, crystals of a series 
of protein or protein complexes are obtained and then the 
three-dimensional structures of each crystal is solved. 
Such an approach provides insight into the association 
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between the proteins and compounds of each complex. This 
is accomplished by selecting compounds with inhibitory 
activity, obtaining crystals of this new protein/ compound 
complex, solving the three-dimensional structure of the 
5 complex, and comparing the associations between the new 
protein/ compound complex and previously solved 
protein/compoiind complexes. By observing how changes in 
the coittpoiind affected the protein/compound associations, 
these associations may be optimized. 

10 [0165] In some cases, iterative drug design is carried 
out by forming successive protein- compoxind complexes and 
then crystallizing each new coir^slex. High throughput 
crystallization assays may be used to find a new 
crystallization condition or to optimize the original 

15 protein or complex crystallization condition for the new 
contplex. Alternatively, a pre- formed protein crystal may 
be soaked in the presence of an inhibitor, thereby 
forming a protein /compound complex and obviating the need 
to crystallize each individual protein /compound complex. 

20 

Struclmre Determination of Other Moleculea 

[0166] The structure coordinates set forth in Figure 1 
or 2 can also be used to aid in obtaining structural 
information about other crystallized molecules or 
25 molecular complexes. This may be achieved by any of a 
number of well-known techniques, including molecular 
replacement . 

[0167] In one embodiment, the structure coordinates of 
said molecules or molecular coinplexes are produced by 
30 homology modeling of the coordinates of Figure 1 or 2 . 
Homology modeling can be used to generate structural 
models of Sylccat homologues or other homologous proteins 
based on the known structure of Sykcat* This can be 
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achieved by performing one or more of the following steps: 
performing sequence alignment between the amino acid 
sequence of an unknown molecule against the amino acid of 
Syk; identifying conserved and variable regions by sequence 
5 or structure; generating structure coordinates for 

structurally conserved residues of the unknown structure 
from those of Syk; generating conformations for the 
• structurally variable residues in the \inknown structure; 
replacing the non-conserved residues of Syk with residues 
10 in the unlaiown structure; building side chain 

conformations; and refining and/or evaluating the unknown 
structure. 

[0168] For example, since the protein sequence of the 
catalytic domains of Syk and ZAP-70 can be aligned 

15 relative to each other, it is possible to construct 

models of the structures of ZAP-70, particularly in the 
regions of the active site, using the Sykcat structure. 
Software programs that are useful in homology modeling 
include XALIGN (Wishart, D. S. et al . , Comput. Appl. 

20 Biosci., 10, pp. 687-88 (1994)) and CLUSTAL W Alignment 

Tool (Higgins D. G. et al.. Methods Enzymol, 266, pp. 383- 
402 (1996)). See also, U.S. Patent No. 5,884,230. These 
references are incorporated herein by reference. 
[0169] To perform the sequence alignment, programs such 

25 as the "bestfit" program available from the Genetics 
Computer Group (Waterman in Advances in Applied 
Mathematics 2, 482 (1981), which is incorporated herein 
by reference) and CLUSTAL W Alignment Tool (Higgins D. G. 
et al.. Methods Enzymol, 266, pp. 383-402 (1996), which is 

30 incorporated by reference) can be used. To model the 

amino acid side chains of ZAP-70, the amino acid residues 
in Syk can be replaced, using a computer graphics program 
such as "O" (Jones et al, (1991) Acta Cryst. Sect. A, 47: 
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110-119), by those of the homologous protein, where they 
differ. The Scune orientation or a different orientation 
of the amino acid can be used. Insertions and deletions 
of amino acid residues may be necessary where gaps occur 
5 in the sequence alignment. However, certain portions of 
the active site of Syk and its homologues are highly 
conserved with essentially no insertions and deletions. 
[0170] Homology modeling can be performed using, for 
example, the computer programs SWISS-MODEL available 
10 through Glaxo Wellcome Experimental Research in Geneva, 

Switzerland; WHATIF available on EMBL servers; Schnare et 
al. (1996) J. Mol. Biol, 256: 701-719; Blundell et al. 
(1987) Nature 326: 347-352; Fetrow and Bryant (1993) 
Bio /Technology 11:479-484; Greer (1991) Methods in 
15 Enzymology 202: 239-252; and Johnson et al (1994) Crit- 
Rev. Biochem. Mol Biol, 29:1-68. An example of homology 
modeling can be foxind, for example, in Szklarz G.D (1997) 
Life Sci. 61: 2507-2520- These references are 
incorporated herein by reference. 
20 [0171] According to an alternate embodiment, the 

machine-readable data storage medium comprises a data 
storage material encoded with a first set of machine 
readable data that comprises the Fourier transform of at 
least a portion of the structure coordinates set forth in 
25 Figure 1 or 2 or homology model thereof, and which, when 
using a machine programmed with instructions for using 
said data, can be combined with a second set of machine 
readable data comprising the X-ray diffraction pattern of 
a molecule or molecular complex to determine at least a 
30 portion of the structure coordinates corresponding to the 
second set of machine readable data. 

[0172] In another embodiment, the invention provides a 
computer for determining at least a portion of the 
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structure coordinates corresponding to X-ray diffraction 
data obtained from a molecule or molecular complex having 
an \inknown structure, wherein said computer comprises: 

(a) a machine-readable data storage 

5 medium comprising a data storage material encoded with 
machine-readable data, wherein said data comprises at 
least a portion of the structure coordinates of Sykcat 
according to Figure 1 or 2 or homology model thereof; 

(b) a machine-readable data storage 

10 medium comprising a data storage material encoded with 

machine-readable data, wherein said data comprises X-ray 
diffraction data obtained from said molecule or molecular 
complex having an unlcnown structure; and 

(c) instructions for performing a Fourier 
15 transform of the machine-readable data of (a) and for 

processing said machine-readable data of (b) into 
structure coordinates. 

[0173] For exanple, the Fourier transform of at least 
a portion of the structure coordinates set forth in 

20 Figure 1 or 2 or homology model thereof may be used to 

determine at least a portion of the structure coordinates 
of Syk protein, Sykcat protein homologues, or proteins 
sufficiently homologous to Sykcat- In one embodiment, the 
molecule is a Sykcat homologue. In another embodiment, 

25 the molecular complex is selected from the group 

consisting of Sykcat complex and Sykcat homologue complex, 
[0174] Therefore, in another embodiment this invention 
provides a method of utilizing molecular replacement to 
obtain structural information about a molecule or a 

30 molecular complex of unknown structure wherein the 

molecule or molecular complex is sufficiently homologous 
to Sykcat/ comprising the steps of: 
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(a) crystallizing said molecule or 
molecular complex of unknown structure; 

(b) generating a X-ray diffraction 
pattern from said crystallized molecule or molecular 

5 complex; 

(c) applying at least a portion of the 
Sykcat structure coordinates set forth in one of Figure 1 
or 2 or in a homology model thereof to the X-ray 
diffraction pattern to generate a three-dimensional 

10 electron density map of at least a portion of the 

molecule or molecular conqplex whose structure is tinknown; 
and 

(d) generating a structural model of the 
molecule or molecular complex from the three-dimensional 

15 electron density map. 

[0175] In one eiribodiment, the method is performed 
using a computer. In another embodiment, the molecule is 
selected from the group consisting of a Syk catalytic 
domain protein, a Syk catalytic domain homologue, and a 

20 Syk protein. In another embodiment, the molecule is 
selected from the group consisting of a Syk catalytic 
domain protein complex, a Syk catalytic domain homologue 
complex, and a Syk protein complex. 

[0176] By using molecular replacement, all or part of 
25 the structure coordinates of the Sykcat as provided by 
this invention (amd set forth in Figure 1 or 2) can be 
used to determine the structure of a crystallized 
molecule or molecular complex whose structure is unknown 
more quickly and efficiently than attempting to determine 
30 such information ab initio. 

[0177] Molecular replacement provides an accurate 
estimation of the phases for an unknown structure. 
Phases are a factor in equations used to solve crystal 
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structures that can not be determined directly - 
Obtaining accurate values for the phases, by methods 
other than molecular replacement, can be a time-consuming 
process that involves iterative cycles of approximations 
5 and refinements and greatly hinders the solution of 

crystal structures. However, when the crystal structure 
of a protein containing at least a homologous portion has 
been solved, the phases from the known structure may 
provide a satisfactory estimate of the phases for the 

10 xinknown structure. 

[0178] Thus, this method involves generating a 
preliminary model of a molecule or molecular complex 
whose structure coordinates are \anknown, by orienting and 
positioning the relevant portion of the Sykcat according 

15 to Figure 1 or 2 or homology model thereof within the 
luiit cell of the crystal of the unknown molecule or 
molecular complex so as best to account for the observed 
X-ray diffraction pattern of the crystal of the molecule 
or molecular complex whose structure is unknown. Phases 

20 can then be calculated from this model and combined with 
the observed X-ray diffraction pattern amplitudes to 
generate an electron density map of the structure whose 
coordinates are \mknown. This, in turn, can be subjected 
to any well-known model building and structure refinement 

25 techniques to provide a final, accurate structure of the 
unknown crystallized molecule or molecular coinplex (E. 
Lattman, **Use of the Rotation and Translation Functions", 
in Meth. Enzymol., 115, pp. 55-77 (1985); M. G. Rossmann, 
ed. , "The Molecular Replacement Method", Int. Sci. Rev. 

30 Ser., No. 13, Gordon & Breach, New York (1972)). 
[0179] The structure of any portion of any 
crystallized molecule or molecular complex that is 
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sufficiently homologous to any portion of the Sykcat can 
be resolved by this method. 

[0180] In one embodiment, the method of molecular 
replacement is utilized to obtain structural information 
5 about a Sykcat homologue. The structure coordinates of 
SyJccat as provided by this invention are particularly 
useful in solving the structure of Sykcat complexes that 
are bound by ligands, substrates and inhibitors. 
[0181] Furthermore, the structure coordinates of Sykcat 

10 as provided by this invention are useful in solving the 
structure of Sykcat proteins that have amino acid 
substitutions, additions and/or deletions (referred to 
collectively as **Sykcat mutants", as compared to naturally 
occurring Sykcat) • These Sykcat mutants may optionally be 

15 crystallized in co-coinplex with a chemical entity, such 
as a non-hydrolyzable ATP analogue or a suicide 
substrate. The crystal structures of a series of such 
complexes may then be solved by molecular replacement and 
compared with that of wild- type Sylccat- Potential sites 

20 for modification within the various binding pockets of 
the enzyme may thus be identified. This information 
provides an additional tool for determining the most 
efficient binding interactions, for example, increased 
hydrophobic interactions, between Sykcat and a chemical 

25 entity or compound. 

[0182] The structure coordinates are also particularly 
useful in solving the structure of crystals of Sykcat or 
Sykcat homologues co-coinplexed with a variety of chemical 
entities. This approach enables the determination of the 

30 optimal sites for interaction between chemical entities, 
including candidate Sykcat inhibitors. For example, high 
resolution X-ray diffraction data collected from crystals 
exposed to different types of solvent allows the 
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determination of where each type of solvent molecule 
resides. Small molecules that bind tightly to those 
sites can then be designed and synthesized and tested for 
their Sykcat inhibition activity. 
5 [0183] All of the complexes referred to above may be 
studied using well-known X-ray diffraction techniques and 
may be refined using 1.2-3.4 A resolution X-ray data to 
an R value of about 0.30 or less using computer software, 
such as X-PLOR (Yale University, ©1992, distributed by 

10 Molecular Simulations, Inc.; see, e.g., Blundell & 

Johnson, supra; Meth. Enzymol., vol. 114 & 115, H. W. 
Wyckoff et al., eds.. Academic Press (1985)) or CNS 
(Brunger et al., Acta Cryst., D54, pp. 905-921, (1998)). 
[0184] In order that this invention be more fully 

15 understood, the following examples are set forth. These 
examples are for the purpose of illustration only and are 
not to be construed as limiting the scope of the 
invention in any way. 

20 Example 1: Expression and Purification of Sykcat 

[0185] Residues 343-635 of human Syk (accession number 
A53596) were cloned • PGR was carried out on the 
basophil-like leukemic cell line KU 812 (Kishi, K. , LeuJt. 
Res. 9, pp. 381-390 (1985)) using the following primers 

25 in order to clone Human Syk kinase domain cDNA: 

5'-CGCGGATCCGCCACCATGGACACAGAGGTGTACGAGAGC-3' (SEQ ID NO: 

3) 

and 

30 5 ' - CGGCGGATCCTTAATG ATGATGATGATGATGGTTC ACC ACGTC A 
TAGTAGTAATTGCG-3' (SEQ ID NO: 4). 
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[0186] The PGR amplicon was cloned directly into 
pFastBacl (Life Technologies) using BainHl to make the 
recombinant baculoviral shuttle vector pFB-CatSyk. The 
DNA sequence was verified using an Applied Biosystems 
5 sequencer. The PGR primer sequences introduced an 

optimal Kozak translational initiation sequence (Kozak, 
M., Nuc. Acids Res 12, pp. 857-872 (1984)) at the 5' end 
of the sequence coding for the catalytic domain of Syk 
(residues 343-635 of full-length SEQ ID NO: 1; GenBank 

10 accession nximber A53596) and a hexahistidine purification 
tag at the carboxyl- terminal end of Sykcat- SEQ ID NO: 1 
(GenBank accession number A53596) below shows the amino 
acid sequence of human Syk. Amino acid residue numbers 
in the text and Figures 1 and 2 follow the amino acid 

15 residue numbering system of the full-length Syk protein. 



SEQ ID NO: 1: 
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[0187] Transposition to generate Syk baculoviral DNA 
was performed in DHlOBac E.coli (Life Technologies) . 

20 Lipofection was then used to transform Sf9 insect cells 
with baculoviral DNA in order to generate seed 
baculoviral stocks. The baculoviral stocks were 
amplified four times in insect-free media (Insect Xpress, 
Biowhittaker) using shaker flasks at 27^C and 100 rpm. 

25 The fourth amplification viral stocks were used at a 

multiplicity of infection of 10 in 3 day infections of 
2x10^ cells /ml shaker cultures for protein production. 
[0188] Protein yields of 2-5 mg/1 intracellular 
protein were obtained for Sykcat after cobalt chelation 

30 chromatography/ FPLC purification. The buffer was 

exchanged to 20 iriM diethanolamine (pH 8.6), 500 .mM NaCl, 
and the protein was concentrated using microf iltration. 
The protein used in crystallizations of Sykcat complexed 
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with PT426 and adenylyl imidodiphosphate (AMP-PNP) was 
further purified using a Mono Q column (Pharmacia) • 



Example 2 ; 



Formation of Sykcat- inhibitor or Sykcaf 
Peptide-AMP-PNP Coinplex for 
Crystallization 



5 



Sykcat in Complex with Staurosporine 



[0189] 



Staurosporine, a microbial alkaloid from 



Strep tomyces sp, (Omuru et al . , J. Antibiot,, 48, pp. 
10 275-282 (1977) ) , is a potent broad-range kinase 
inhibitor. Sykcat protein (2-4 mg/mL) in 20 mM 
Diethanolamine at pH 8.6 and 0.5 M NaCl was combined with 
300 nM staurosporine . 

15 Sykcat in Complex with Peptide, PT426 

[0190] Sykcat protein (2-4 mg/mL) in diethylamine 
hydrochloride at pH 8.6 and 0.5 M NaCl was combined with 
500 /xM NAc-61u-Glu-Asp-Asp-Tyr-Glu-Ser-Pro-NH2 peptide 
(NAC-EEDDYESP-NH2/ or PT426; SEQ ID NO: 2) , 2 mM AMP-PNP 

20 and 6 mM MgCl2 . 

Example 3 ; Crystallization of Sykcat and Sykcat" 



the hanging-drop vapor diffusion method. Equal volumes 
of 2 mg/ml protein solution in 20 mM diethanolamine (pH 
8.6), 500 mM NaCl with 300 fxM staurosporine and a 
30 reservoir solution containing 20% PEG 2K, 0.2 ammonium 

acetate, 0.1 M sodium cacodylate (pH 5.23) were combined 
and placed over the reservoir containing reservoir 



inhibitor complexes thereof Syk Catalytic 
Domain in Complex with Staurosporine 



25 



[0191] 



Sykcat -staurosporine complex was crystallized by 
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solution. Plate like crystals began to form 24 hours 
later, and grew to a maximum size of 0.3x0.3x0,1 mm^ 
after 10 days . 

5 Sykcat in Complex with Peptide/ PT426 

[0192] Sykcat- PT42 6 -AMP- PNP complex was also 
crystallized by hanging-drop vapor diffusion method. 
Equal volumes of 3 mg/ml protein solution in 2 0 mM 
Diethanolamine (pH 8.6), 500 mM NaCl with 2 mM AMP-PNP, 6 

10 mM MgCl2 and 500 mM of the peptide PT426 and reservoir 

solution containing 22% PEG 2K, 0.2 M magnesium acetate, 
0.1 M sodium cacodylate (pH 5.23) were combined and 
placed over the reservoir. Plate- like crystals started 
to form after 1 day and grew to 0.3 x 0.3 x 0.1 mm^ after 

15 seven days. 

Example 4: X-Ray Data Collection and 

Structure Determination 
[0193] X-ray diffraction data was collected from Sykcat 

2 0 complex crystals at 100 K at European Synchrotron 

Radiation facility (ESRF) . Crystals were flash- frozen 
from cryosolution containing reservoir or well solution. 
[0194] Crystals of Sykcat- staurosporine were "annealed" 
before data collection. The term ''anneal" refers to 

25 allowing a previously cryogenically- cooled sample to warm 
in temperature (as seen by visible signs of thawing) 
before flash-cooling or flash- freezing the sample again. 
This annealing process may be performed once or repeated 
multiple times. Annealing is performed by blocking a 

30 cryocooling stream from a crystal sample mounted on a 

goniostat for several seconds before allowing the crystal 
to be exposed to the stream again. Annealing is used to 
reduce mosaicity and increase the limit of resolution 
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(Yeh and Hoi, Acta Cryst. D54, pp. 479-480 (1998); Harp 
et al.. Acta Cryst. D55, pp. 1129-1334 (1999); Harp et 
al. Acta Cryst., D54, pp. 622-628 (1998)). The annealing 
step may also reduce the static disorder of the crystal 
5 (Garman, Acta Cryst. D55 , pp. 1641-1653 (1999)). 

[0195] The data sets were processed with DENZO and 
SCALEPACK (Otwinowski, Z. & Minor, W. , Methods in 
Enzymol. 276, pp. 307-326 (1997)). Detailed information 
about data statistics is provided in Table 1 and Table 2 
10 for Sykcat-staurosporine and Sykcat-PT42 6-AMP-PNP 
complexes, respectively. All crystal lographic 
calculations were performed using the CCP4 program 
package CCP4 (The CCP4 suite: Programs for protein 
crystallography. Acta. Cryst. D50 , pp. 760-763 (1994)). 



wo 2004/035604 



PCTAJS2003/032812 



73 

Table 1 

Summary of data collection for the Sykcat- 
staurosporine complex. 





Sykcar-Staurosporine 


Crystal data 




Space group 


P2|2i2, 


Unit cell parameters (A) 


a=39.45 
b=84.17 
c=85.00 


Monomers per 
asymmetric unit 


1 


Solvent content (%) 


38.5 






Sy kcaHStau rosporine 


Data collection 




X-ray source 


ID14-4(ESRF) 


Image plate system 


Q4 CCD (ADSC) 


Wavelength (A) 


1.0055 


Data 




Resolution range (A) 
(overall/outer shell) 


35-1.65 (1.69-1.65) 


Total reflections 


679,250 


Unique reflections 


42,026 


Completeness (%) 
(overall/outer shell) 


99.0 (90.0) 


I/O (I) 

(overall/outer shell) 


24.6 (4.2) 


R merge (%)* 
(overall/outer shell) 


4.2 (18.2) 



^merge - 
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Table 2 

Summary of data collection for the Sykcat-PT42 6- 
AMP-PNP complex. 





"^y Kcar * 1 4ZO- AMJr-FJN r 


Crystal data 




Space group 


P2i2|2, 


Unit cell parameters 
(A) 


a=39.58 
b=84.67 
c=90.63 


Monomers per 
asynmietric unit 


1 


Solvent content (%) 


43.1 






SykcarPT426-AMP-PNP 


Data collection 




X-ray source 


ID14-4(ESRF) 


Image plate system 


Q4 CCD (ADSC) 


Wavelength (A) 


0.95 


Data 




Resolution range (A) 
(overall/outer shell) 


40 - 2.4(2.44 - 2.40) 


Total reflections 


314 575 


Unique reflections 


12 559 


Completeness (%) 
(overall/outer shell) 


99.4 (90.9) 


I/O (I) 

(overall/outer shell) 


32.2 (9.5) 


R merge (%)* 
(overall/outer shell) 


9.1 (2.5) 



^merge - 
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Sykcat In Complex With Staurosporine 

[0196] The Sykcat- staurosporine structure was solved by 
molecular replacement. Molecular replacement 
calculations were performed with the program AMoRe as 
5 implemented in the CCP4 suite (Navaza, Acta. Cryst. D50, 
pp. 157-163 (1993)). A homology model of the catalytic 
domain of Syk (residues Lys368 to Asn635) was used as a 
search model in molecular replacement. This Sylccat 
homolocQT model was built predominantly using the 

10 structure of the Lck (LymphoCyte-specif ic Kinase) protein 
kinase structure (PDB accession code IQPJ) with a program 
Modeler™ (Accelrys) . A rotation search and a subseccuent 
translation search (resolution ranges 10.0 - 2.8 A) 
produced a single top solution (a = 9.10, p = 53.83, y = 

15 304.71, X = 0.09, y = 0.132 and z = 0.211) with a 

correlation coefficient of 76.2% and an R factor of 
35.9%. 

[0197] The initial molecular replacement solution 
featured only amino acid residues present in the original 

20 search model- The automated refinement program ARP/wARP 
(Lamzin & Wilson, Acta Cryst., D49, pp. 129-147 (1993); 
Lcumzin & Wilson, Meth. Enzym. , 211, pp. 269-305 (1997)) 
was used to improve the electron density maps and build 
in more amino acid residues and solvent molecules. 

25 [0198] The output model from ARP/wARP was then further 
refined using the maximum likelihood approach implemented 
in RefMac 5.0 (Murshudov, et al., "Application of Maximxim 
Likelihood Refinement" in the Refinement of Protein 
structures. Proceedings of Dareshury Study Weekend 

30 (1996); Murshudov, et al.. Acta Cri^st., D53, pp. 240^ 

255(1997); Murshudov, et al.. Acta Cryst., D55 pp. 247- 
255 (1999) ) . After five cycles of restrained isotropic 
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refinement, the resulting R factor was 19.4% and Rfree was 
23%. 

[0199] Electron density maps built after RefMac cycles 
were viewed using QUANTA (Accelrys ©2001,2002) and model 
5 building was performed using a QUANTA module X-AUTOFIT 
(Oldfield, Proceedings from the 1996 Meeting of the 
International Union of Crystallography Macromolecular 
Computing School; see http://www.sdsc.edu/Xtal/IUCr/ 
CC/School96/; Accelrys ©2001,2002). Graphical remodeling 

10 steps included mutating amino acid side chains of the 
Sykcat homology model to the Sykcat residue side chains, 
and rebuilding the loop and disordered regions. Sykcat 
amino acid residues Asn406 to Pro411 were not built into 
the model because the electron density was weak in these 

15 regions . 

[0200] Electron density maps showed the presence of 
extra electron density in the shape of a flat, multi- 
cyclic ligand located in the cleft between the N-terminal 
and C- terminal domains, where the putative ATP-binding 

20 site is located. Staurosporine was built into the model 
using geometric restraints inferred by modeling 
staurosporine into the Sykcat homology model used earlier 
for molecular replacement, and restraints of the crystal 
structure of Csk (C-terminal Src Kinase) complexed with 

25 staurosporine (PDB accession code IBYG) . 

[0201] After minimal model building, the ligand fit 
well with the extra electron density present in the ATP- 
binding site. Final refinement was done by using the 
program RefMac interspersed with manual model building 

30 using X-AUTOFIT in QUANTA (Oldf ield, supra) . 

[0202] For the Sykcat- staurosporine complex, the final 
Rworking and Rfree was 16.4% and 19.5%, respectively. 5.0% 
of data was used for the test set in the calculation of 
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Rfree- The final model contains all amino acid residues of 
Sykcat except the first 15 N-terminal amino acid residues 
and amino acid residues Asn406-Asp410 , 374 water 
molecules, four amino acid residues from the N-terminal 
5 end of a neighboring Sykcat molecule, and one 

staurosporine. No tyrosines amino acid residues were 
phosphorylated in the final model. 

[0203] The final model was validated using programs 
PROCHECK (Laskowski et al . , J. Appl . Cryst., 26, pp. 283- 

10 291 (1993); Morris et al . , Proteins, 12, pp. 345-364 

(1992)) and SQUID VALIDATE (Molecular Simulations, Inc., 
San Diego, CA ©1998, 2000; Accelrys ©2001,2002). A 
Ramachandran plot for the resulting Sykcat- staurosporine 
complex showed 92.5% of all non-Gly, non-Pro residues in 

15 the ^^core" or most favorable regions, 7.1% in ^^allowed" 
regions, and one residue in the **generally allowed" 
regions . 



Sykcat In Complex With Peptide^ PT426 

20 [0204] The Sykcat -PT42 6 -AMP- PNP structure was solved by 
molecular replacement using the structure of the Sykcat- 
staurosporine complex described above, without ligand or 
solvent molecules, as the search model. A rotation 
search followed by a translation search (resolution 

25 ranges 10.0 - 3.0 A) produced a single top solution with 
a correlation coefficient of 63% and an R- factor of 42.4% 
after refinement in AmoRe (Navaza, J., Acta. Cryst. D50, 
pp. 157-163 (1993)). The molecular replacement solution 
corresponds to the rotation and translation function a = 

30 9.00, |3 = 50.80, y = 306.77, x = 0.110, y = 0.134 and z = 
0.212 

[0205] Refinement was performed using the maximum 
likelihood approached implemented in RefMac (Murshudov, 
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et alw "Application of Maximum Likelihood Refinement" in 
the Refinement of Protein structures. Proceedings of 
Daresbury Study Weekend (1996); Murshudov, et al . , Acta 
Cryst., D53, pp. 240-255(1997); Murshudov, et al , , Acta 
5 Cryst., D55 pp. 247-255 (1999)). Resulting electron 

density maps were viewed in QUANTA (Accelrys ®2001,2002) 
and model building was carried out using X-AUTOFIT 
(Oldf ield, supra) . Iterative cycles of model building 
and refinement were used to improve electron density 

10 maps. No model was built for Sykcat amino acid residues 
Lys405-Pro411 because of weak electron density. 
[0206] Difference density maps showed tetrahedral 
bulges of density around the end of tyrosine residues 
Tyr525 and Tyr526, suggesting that these amino acid 

15 residues were phosphorylated. Phosphotyrosine residues 
(PTyr525 and PTyr526) were modeled into these positions. 
An AMP-PNP molecule was built into a region of electron 
density in the ATP-binding site. 

[0207] Additional electron density was apparent in the 
20 region around amino acid residues His531 to Pro535. 

These residues are located in the substrate-binding site. 
Density for a tyrosine amino acid side chain was visible. 
Amino acid residues E, D, D, and Y were built into the 
additional density. The remaining density was weak and 
25 difficult to fit. The weak density may result from a 
flexible or disordered region in the molecule or less 
than 100% occupancy of the peptide in the binding pocket. 
[0208] Final refinement was done by using the program 
RefMac interspersed with manual model building using X- 
30 AUTOFIT in QUANTA (Oldf ield, supra) . The final model of 
the Sykcat- PT4 2 6 -AMP-PNP complex has a final Rworking and 
Rfree of 19.8% and 28.8%, respectively. 4.8% of data was 
used for the test set in calculating the R£ree- The final 
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model contained the catalytic domain of Syk including 271 
water molecules, PT426, AMP-PNP, and two Mg*^ atoms. The 
first 21 N- terminal residues, the last C- terminal 
residue, the loop residues Asn381, Phe382, and Lys405- 
5 Pro411 of SyJccat are not present in the model. These 

regions have weak density and can not be modeled. Tyr525 
and Tyr526 were phosphorylated in the final model. 
[0209] The final model was validated using programs 
PROCHECK (Laskowski et al . , J". Appl. Cryst., 26, pp. 283- 

10 291 (1993); Morris et al . , Proteins, 12, pp. 345-364 

(1992)) and SQUID VALIDATE (Molecular Simulations, Inc., 
San Diego, CA ®1998, 2000; Accelrys ®2001,2002). A 
Ramachandran plot for the resulting Sykcat-PT426-AMP-PNP 
complex showed 87.9% of all non-Gly, non-Pro residues in 

15 the "core" or most favorable regions, 10.4% in "allowed" 
regions, 13% in the "generally allowed" regions and 0.4% 
(1 residue) in disallowed regions. 

Example 5 ; Overall Structure of Syk Catalytic Domain 

[0210] Syk family tyrosine kinases contain a C- 
terminal catalytic domain and tandem N- terminal SH2 
domains. The present invention provides for the crystal 
structure of the C-terminal catalytic domain of Syk 
(Sykcat) (Figures 3 and 4) . The conventional nomenclature 
for PK secondary structural elements are used in Figures 
3 and 4 (Knighton et al . , (1991); Hubbard et al. (1994)). 
The N- terminal lobe or sub-domain of the catalytic domain 
contains a curled iS- sheet of five anti -parallel jS- strands 
ifil' fiS) and one Of-helix (aC) positioned between the P3 
and j84 strands. The C-terminal lobe or sub-domain 
conprises four jS- strands {(31, ^8 , /89, and iSlO) and eight 
helices {aD, aE, aEF, aF , aG, aH, otUl, and al) . 



20 



25 



30 
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[0211] The two phosphorylated tyrosine residues in the 
Sykcat-PT426-AMP-PNP structure, PTyr525 and PTyr526, are 
found in the activation loop (amino acid residues L515 to 
Y539) , which is present in the kinase domains of most 
5 PTKs. These tyrosine residues have been previously 
identified as sites of autophosphorylation within the 
activation loop of murine Syk (Furlong et al . , Biochim. 
Biophys. Acta, 1355, pp. 177-190 (1997); Zhang et al . , J. 
Biol. Chem,, 215, pp, 35442-35447, (2000)). Lys517, a 
10 highly conserved residue in the PTK family, is in close 
proximity of PTyr526, PTyr525 is in close proximity of 
Lys548. 

[0212] The structures of the two Sykcat complexes in 
the present invention have highly similar structures. 

15 When the two structures are superimposed, the regions of 
highest RMSDs occur in specific regions within the 
catalytic domains including strand 3 and helix C in the 
N- terminal lobe, helices P and H of the C- terminal lobe, 
the sides of the nucleotide binding site, and the 

20 substrate binding site and activation loop. 

Example 6 s Catalytic Active Site of 

Sykcat Complexes 
[0213] The inhibitor staurosporine binds in a 

25 hydrophobic cleft between the N- and C-terminal lobes of 
the Sykcat structure (Figures 3 and 5) . Staurosporine 
forms hydrogen bond interactions with Glu449, Ala451, and 
Arg498. Staurosporine also forms hydrophobic 
interactions with cleft residue side chains. 

30 [0214] In the Sykcat- PT42 6 -AMP- PNP complex structure, 

AMP-PNP binds in the nucleotide -binding site that is 
situated between the N- and C- terminal lobes (Figure 4) . 
Interactions between AMP-PNP and Sykcat protein include 
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hydrogen bonds with residues Ser379, Glu449, Asp512 and 
hydrophobic interactions with cleft residue side chains. 
Additionally, some interaction exists between AMP-PNP and 
Lys402 . The two divalent magnesium ions are coordinated 
5 with negatively charged residues, a- and jS- phosphate 
groups and/or water molecules. Mgl from Figure 2 makes 
contacts to the a- and phosphate groups and residues 
Asn499 and Asp512 . Mg2 from Figure 2 coordinates to the 
/S- phosphate group of AMP-PNP, residue Glu420 and water 
10 molecule 121, which forms a bridge to residue Glu416. In 
turn, the a- and 0- phosphate groups of AMP-PNP interact 
with the magnesium ions, Sykcat amino acid residues 
Lys402, Asn499, Asp512, and a water molecule. 

15 Example 7 ; Substrate -Binding Site of Sylccat Coiig)lexes 

[0215] In the Sykcat-PT426-AMP-PNP complex, supporting 
electron density was seen for the backbone of only four 
of the eight PT42 6 residues. These amino acid residues 

2 0 were initially built in the PT426 structural model as E, 
D, D, Y. The exact identity of these residues can not be 
known for certain since the electron density for all the 
side chains in the electron density was weak in the 
peptide region. In addition, valine was inadvertently 

25 left as the first amino acid residue in the PT426 model 
as a remnant of peptide model building. Although valine 
was given in the structure coordinates listed in Figure 
2,. it is not part of the PT426 peptide. 
[0216] These factors are not expected to affect the 

30 scope or utility of the present invention because 

electron density is visible for the backbone of four 
amino acid residues in the peptide. The visible PT426 
tyrosine residue is the acceptor tyrosine residue for 
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phosphorylation. The PT426 peptide forms various 
hydrogen bonds with amino acid residues Gly532, Lys533, 
Trp534 and Pro535 at the end of the activation loop. The 
hydroxy 1 group of the tyrosine in the peptide forms a 
5 hydrogen bond with the carboxylate group of Asp4 94, 

[0217] In the structure of the Sykcat-staurosporine 
complex, unexpected electron density appeared at the 
substrate binding site. No substrate was added to the 
Sykcat-staurosporine complex crystallization, but the N- 
10 terminal end of Sykcat could be built into this unexpected 
density. The N- terminal region of the Syk catalytic 
domain mimics the sequence of PT426: 

N-term EEDDYESP unmodified PT426 

15 (SEQ ID NO: 5) 

N-term DTEVYESP Syk (amino acid 

residues 344-351 of 
SEQ ID N0:1) 



20 The N-terminal region of Sykcat also contains Tyr348, one 
of the proposed major sites of autophosphorylation of Syk 
(Furlong et al . , Biochim. Biophys . Acta, 1355, pp. 177- 
190 (1997)). In the crystal, the N-terminal end of the 
Sykcat packs against the substrate binding pocket of the 

25 neighboring Sykcat molecule, and it extends within a very 
close distance (-20 A) to the peptide moiety in the 
substrate binding pocket of the Sykcat-PT426 complex 
structure. This distance could easily be bridged by 
eight residues between the visible N- terminus of the 

30 Sykcat -Staurosporine complex (Ile358)and the first visible 
residue of the bound peptide (Ser350, if it were an 
extension of the N- terminus) . The presence of the N- 
teirminal end of the Sykcat domain of Syk bound in the 
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substrate binding site either may be the structure of the 
Syk molecule at the time of autophosphorylation or a 
result of crystal packing that forces the N-terminal end 
of Sykcat into the substrate binding site of the Sykcat- 
5 staurosporine complex. 



